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CIp-protease as target for herbicides 

The present invention relates to Clp-protease, which, when absent, brings about re- 
duced growth and chlorotic leaves as target for herbicides. For this purpose, novel nu- 
5 cleic acid sequences encompassing SEQ ID NO:3, SEQ ID NO: 1 1 and SEQ ID NO: 17 
and functional equivalents of SEQ ID NO:3, SEQ ID NO:1 1 and SEQ ID NO: 17 are 
provided. Moreover, the present invention relates to the use of Clp-protease in a 
method for identifying compounds with herbicidal or growth-regulatory activity, and to 
the use of the compounds identified by this method as herbicides or growth regulators. 



15 



The basic principle of identifying herbicides via the inhibition of a defined target is 
known (for example US 5,187,071, WO 98/33925, WO 00/77185). In general, there is a 
great demand for the detection of enzymes which might constitute novel targets for 
herbicides. The reasons are resistance problems which occur with herbicidal active 
ingredients which act on known targets, and the ongoing endeavor to identify novel 
herbicidal active ingredients which are distinguished by as wide as possible a spectrum 
of action, ecological and toxicological acceptability and/or low application rates. 

In practice, the detection of novel targets entails great difficulties since the inhibition of 
20 an enzyme which forms part of a metabolic pathway frequently has no further effect on 
the growth of the plant This may be attributed to the fact that the plant switches to al- 
ternative metabolic pathways whose existence is not known or that the inhibited en- 
zyme is not limiting for the metabolic pathway Furthermore, plant genomes are 
distinguished by a high degree of functional redundancy. Functionally equivalent 
25 enzymes are found more frequently in gene families in the Arabidopsis thaliana 
genome than in Insects or mammals (Nature, 2000, 408(6814):796-815). This 
hypothesis is confirmed experimentally by the fact that comprehensive gene knock-out 
programs by T-DNA or transposon insertion into Arabidopsis yielded fewer manifested 
phenotypes to date than expected (Curr. Op. Plant Biol. 4, 2001, pp.1 11-1 17). 

It is an object of the present invention to identify novel targets which are essential for 
the growth of plants or whose inhibition leads to reduced plant growth, and to provide 
methods which are suitable for identifying herbicidally active and/or growth-regulatory 
compounds. 

We have found that this object is achieved by the use of nuclear encoded Clp-pratease 
in a method for identifying herbicides. 

Further terms used in the description are now defined at this point. 

"Affinity tag*: this refers to a peptide or polypeptide whose coding nucleic acid se- 
quence can be fused to the nucleic acid sequence according to the invention either 
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directly or by means of a linker, using customary cloning techniques. The affinity tag 
serves for the isolation, concentration and/or selective purification of the recombinant 
target protein by means of affinity chromatography from total cell extracts. The above- 
mentioned linker can advantageously contain a protease cleavage site (for example for 
5 thrombin or factor Xa), whereby the affinity tag can be cleaved from the target protein 
when required. Examples of common affinity tags are the "His tag", for example from 
Qiagen, Hilden, "Strep tag*, the "Myo tag* (Invitragen, Carlsbeng), the tag from New 
England Biolabs which consists of a chitin-bihding domain and an inteine, the maltose- 
binding protein (pMal) from New England Biolabs, and what is known as the CBD tag 
10 from Novagen. In this context, the affinity tag can be attached to the 5' or the 3' end of 
the coding nucleic acid sequence with the sequence encoding the target protein. 

"Activity of nuclear encoded Op-protease": the term activity describes the ability of an 
enzyme to convert a substrate into a product. The enzymatic activity can be deter- 

1 5 mined in what is known as an activity assay via the increase in the product, the de- 
crease in the substrate (or starting material) or the decrease in a specific cofactor, or 
via a combination of at least two of the abovementioned parameters, as a function of a 
defined period of time. k Activity of nuclear encoded Clp-protease" describes here the 
ability of an enzyme to catalyze the hydrolysis of peptides of maximal five amino acids 

20 in vitro. 

"Expression cassette": an expression cassette contains a nucleic acid sequence ac- 
cording to the invention linked operably to at least one genetic control element, such as 
a promoter, and, advantageously, a further control element, such as a tenminator. The 

25 nucleic acid sequence of the expression cassette can be for example a genomic or 
complementary DNA sequence or an RNA sequence, and their semisynthetic or fully 
synthetic analogs. These sequences can exist in linear or circular form, extrachromo- 
somally or integrated into the genome. The nucleic acid sequences in question can be 
synthesized or obtained naturally or contain a mixture of synthetic and natural DNA 

30 components, or else consist of various heterologous gene segments of various organ- 
isms. 

Artificial nucleic acid sequences are also suitable in this context as long as they make 
possible the expression, in a cell or an organism, of a polypeptide with the enzymatic 
35 activity of a nuclear encoded Clp Protease, preferably with the biological activity of a a 
nuclear encoded Clp Protease, which polypeptide is encoded by a nucleic acid se- 
quence according to the invention. For example, synthetic nucleotide sequences can 
be generated which have been optimized with regard to the codon usage of the organ- 
isms to be transformed. 

40 

AH of the abovementioned nucleotide sequences can be generated from the nucleotide 
units by chemical synthesis in the manner known per se, for example by fragment con- 
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densation of individual overlapping complementary nucleotide units of the double helix 
Oligonucleotides can be synthesized chemically for example in the manner known per 
se using the phosphoamidite method (Voet. Voet, 2nd Edition, Wiley Press New York 
pp. 896-897). When preparing an expression cassette, various DNA fragments can be 
manipulated in such a way that a nucleotide sequence with the correct direction of 
reading and the correct reading frame is obtained. The nucleic acid fragments are 
linked with each other via general cloning techniques as are described, for example, in 
T. Maniatis. E.F. Fritsch and J. Sambrook. "Molecular Cloning: A Laboratory Manual- 
Cold Spnng Harbor Laboratory. Cold Spring Harbor, NY (1989), in T.J. Silhavy M L ' 
Barman and L.W. Enquist. Experiments with Gene Fusions, Cold Spring Harbor Labo- 
ratory. Cold Spring Harbor. NY (1984) and in Ausubel. F.M. et al.. -Current Protocols in 
Molecular Biology, Greene Publishing Assoc. and Wiley-interscience (1994). 

"Operable linkage" or 'functional linkage": an operable, or functional, linkage is under- 
stood as meaning the sequential arrangement of regulatory sequences or genetic con- 
troi elements in such a way that each of the regulatory sequences, or each of the ge- 
net.c control elements, can fulfill its intended function when the coding sequence is 
expressed. 



Functional equivalents" describe, in the present context, nucleic acid sequences which 
hybndize under standard conditions with the nucleic acid sequence SEQ id NO-1 SEQ 
ID NO:3, SEQ ID NO:5. SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO* 
SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17 or parts of the aforementioned nu- 
cleic aod sequences and which are capable of bringing about the expression, in a cell 
25 or an organism, of a polypeptide with the activity of Clp protease. 

To cany out the hybridization, it is advantageous to use short oligonucleotides with a 
length of approximately 10-50 bp, preferably 15-40 bp, for example of the conserved or 
other regions, which can be determined in the manner with which the skilled worker is 

30 familiar by comparisons with other related genes. However, longer fragments of the 
nucleic acids according to the invention with a length of 100-500 bp. or the complete 
sequences, may also be used for hybridization. Depending on the nucleic 
acid/oligonucleotide used, the length of the fragment or the complete sequence, or de- 
pending on which type of nucleic acid, i.e. DNA or RNA, is being used for the hybridiza- 

35 tion, these standard conditions vary. Thus, for example, the melting temperatures for 
DNA'DNA hybnds are approximately 10°C lower than those of DNARNA hybrids of the 
same length. 



Standard hybridization conditions are to be understood as meaning, depending on the 
nucleic acid, for example temperatures of between 42 and 58oC in an aqueous buffer 
solution with a concentration of between 0.1 and 5 x SSC (1 X SSC ■ 0.15 M NaCI 15 
mM sodium citrate, pH 7.2) or additionally in the presence of 50% formamide. such as. 
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for example. 42°C in 5 x SSC t 50% formamide. The hybridization conditions for 
DNA:DNA hybrids are advantageously 0.1 x SSC and temperatures of between ap- 
proximately 20°C and 65°C, preferably between approximately 30°C and 45°C. In the 
case of DNA:RNA hybrids, the hybridization conditions are advantageously 0.1 x SSC 
5 and temperatures of between approximately 30 °C and 65 °C, preferably between ap- 
proximately 45°C and 55 °C. These hybridization temperatures which have been stated 
are melting temperature values which have been calculated by way of example for a 
nucleic acid with a length of approx. 1 00 nucleotides and a G + C content of 50% in the 
absence of formamide. The experimental conditions for DNA hybridization are de- 

10 scribed in relevant textbooks of genetics such as, for example, fn Sambrook et aL, "Mo- 
lecular Cloning", Cold Spring Harbor Laboratory, 1989, and can be calculated using 
formulae with which the skilled worker is familiar, for example as a function of the 
length of nucleic acids, the type of the hybrids or the G + C content The skilled worker 
will find further information on hybridization in the following textbooks: Ausubel et aL 

1 5 (eds), 1985, "Current Protocols in Molecular Biology". John Wiley & Sons, New York; 
Hames and Higgins (eds.), 1985, "Nucleic Acids Hybridization: A Practical Approach". 
IRL Press at Oxford University Press, Oxford; Brown (ed,). 1991, Essential Molecular 
Biology: A Practical Approach, IRL Press at Oxford University Press, Oxford. 

20 Afunctional equivalent of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:11 f SEQ ID NO:3, SEQ ID NO:13, SEQ ID NO:15 or SEQ 
ID NO: 17 can be furthermore defined by the degree of homology or identity with SEQ 
ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 ? SEQ ID NO;9, SEQ ID NO:11 s 
SEQ ID NO:3. SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17, respectively, and can 

25 furthermore comprise also natural or artificial mutations of the aforementioned nucleic 
acid sequences which encode a polypeptide with the activity of a nuclear encoded Clp- 
protease. 

The present invention also encompasses, for example, those nucleotide sequences 
30 which are obtained by modification of the SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5. 
SEQ ID NO:7 f SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, SEQ ID NO:13, SEQ ID 
NO:15orSEQIDNO:17. 

For example, such modifications can be generated by techniques with which the skilled 
35 worker is familiar, such as "Site Directed Mutagenesis", "Error Prone PCR", W DNA- 
shuffling" (Nature 370, 1994, pp.389-391) or "Staggered Extension Process" (Nature 
Biotechnol. 16, 1998, pp.258-261). The aim of such a modification can be, for example, 
the insertion of further cleavage sites for restriction enzymes, the removal of DNA in 
order to truncate the sequence, the substitution of nucleotides to optimize the codons, 
40 or the addition of further sequences. Proteins which are encoded via modified nucleic 
acid sequences must retain the desired function despite a deviating nucleic acid se- 
quence. 
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The term "functional equivalents" can also relate to the amino acid sequence encoded 
by the nucleic acid sequence in question. In this case, the term "functional equivalent" 
describes a protein whose amino acid sequence has a defined percentage of identity or 
5 homology with SEQ ID NO:3. 

Functional equivalents thus also comprise naturally occurring variants of the herein- 
described sequences and artificial nucleic acid sequences, for example those which 
have been obtained by chemical synthesis and which are adapted to the codon usage, 
10 and also the amino acid sequences derived from them. 

"Genetic control sequence" describes sequences which have an effect on the transcrip- 
tion and. if appropriate, translation of the nucleic acids according to the invention in 
prokaryotic or eukaryotic organisms. Examples thereof are promoters, terminators or 

15 what are known as "enhancer sequences. In addition to these control sequences, or 
instead of these sequences, the natural regulation of these sequences may still be pre- 
sent before the actual structural genes and may, if appropriate, have been genetically 
modified in such a way that the natural regulation has been switched off and the ex- 
pression of the target gene has been modified, that is to say increased or reduced. The 

20 choice of the control sequence depends on the host organism or starting organism. 
Genetic control sequences furthermore also comprise the 5 -untranslated region, in- 
trons or the noncoding 3 -region of genes. Control sequences are furthermore under- 
stood as meaning those which make possible homologous recombination or insertion 
into the genome of a host organism or which permit removal from the genome. Genetic 

25 control sequences also comprise further promoters, promoter elements or minimal 

promoters, and sequences which have an effect on the chromatin structure (for exam- 
ple matrix attachment regions (MARs)), which can modify the expression-governing 
properties, Thus, genetic control sequences may bring about for example the additional 
dependence of the tissue-specific expression on certain stress factors. Such elements 

30 have been described, for example, for water stress, abscisic acid (Lam E and Chua 
NH, J Biol Chem 1991; 266(26); 17131 -17135), high- and low-temperature stress 
(Plant Cell 1994, (6): 251-264) and heat stress (Molecular & General Genetics, 1989, 
217(2-3): 246-53). 

35 "Homology 11 between two nucleic acid sequences or polypeptide sequences is defined 
by the identity of the nucleic acid sequence/polypeptide sequence over in each case 
the entire sequence length, which is calculated by alignment with the aid of the pro- 
gram algorithm GAP according to Needleman and Wunsch 1970, J. MoL Biol. 43; 443- 
453) setting the following parameters for polypeptides: 

40 

Gap Weight: 8 Length Weight: 2 

Average Match: 2,912 Average Mismatch:-2,003 
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and the following parameters for nucleic acids: 

Gap Weight: 50 Length Weight: 3 
5 Average Match: 10.000 Average Mismatch: 0.000 

In the following text, the term identity is also used synonymously with the term "homol- 
ogy". 

10 "Mutations" of nucleic or amino acid sequences comprise substitutions, additions, dele- 
tions, inversions or insertions of one or more nucleotide residues, which may also bring 
about changes in the corresponding amino acid sequence of the target protein by sub- 
stitution, insertion or deletion of one or more amino acids, although the functional prop- 
erties of the target proteins are, overall, essentially retained. 

15 

"Natural genetic environment" means the natural chromosomal locus in the organism of 
origin. In the case of a genomic library, the natural genetic environment of the nucleic 
acid sequence is preferably retained at least in part. The environment flanks the nucleic 
add sequence at least at the 5 - or 3-side and has a sequence length of at least 50 bp, 
20 preferably at least 100 bp, especially preferably at least 500 bp, very especially pref- 
erably at least 1000 bp, and most preferably at least 5000 bp. 

"Plants'' for the purposes of the Invention are plant cells, plant tissues, plant organs, or 
intact plants, such as seeds, tubers, flowers, pollen, fruits, seedlings, roots, leaves, 
25 stems or other plant parts. Moreover, the term plants is understood as meaning propa- 
gation material such as seeds, fruits, seedlings, slips, tubers, cuttings or root stocks. 

"Recombinant DNA" describes a combination of DNA sequences which can be gener- 
ated by recombinant DNA technology. 

30 

"Recombinant DNA technology": generally known techniques for fusing DNA se- 
quences (for example described in Sambrook et aL, 1989, Cold Spring Harbor, NY, 
Cold Spring Harbor Laboratory Press), 

35 "Replication origins* 1 ensure the multiplication of the expression cassettes or vectors 
according to the invention in microorganisms and yeasts, for example the pBR322 ori 
or the P15A ori in E. coli (Sambrook et aL: "Molecular Cloning. A Laboratory Manual*, 
2nd ed. Cold Spring Harbor Laboratory Press. Cold Spring Harbor, NY, 1989) and the 
ARS1 ori in yeast (Nucleic Acids Research, 2000, 28(1 0): 2060-2063). 

40 

"Reporter genes" encode readily quantifiable proteins. The transformation efficacy or 
the expression site or timing can be assessed by means of these genes via growth 
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assay, fluorescence assay, chemoluminescence assay, bioluminescence assay or re- 
sistance assay or via a photometric measurement (intrinsic color) or enzyme activity. 
Very especially preferred in this context are reporter proteins (Schenborn E, Groskreutz 
D. Mol Biotechnol. 1999; 13(1):29-44) such as the "green fluorescent protein* (GFP) 
5 (Gerdes HH and Kaether C ( FEES Lett. 1 996; 389(1 );44-47; Chui WL et al., Curr Biol 
1996, 6:325-330; Leffel SM et al. v Biotechniques. 23(5):912-8, 1997), chloramphenicol 
acetyl transferase, a luciferase (Giacomin, Plant Sci 1996, 116:59-72; Scikantha, J 
Bact 1996, 178:121; Millar et al., Plant Mol Biol Rep 1992 10:324-414), and luciferase 
genes, in general o-galactosidase or D-glucuronidase (Jefferson et al., EMBO J. 1987, 
10 6, 3901-3907) or the Ura3 gene. 

"Selection markers" confer resistance to antibiotics or other toxic compounds: exam- 
ples which may be mentioned in this context are the neomycin phosphotransferase 
gene, which confers resistance to the aminoglycoside antibiotics neomycin (G 418), 

15 kanamydn, paromycin (Deshayes A et al, EMBO J. 4 (1 985) 2731-2737), the sul gene, 
which encodes a mutated dihydropteroate synthase (Guerineau F et al. Plant Mol Biol. 
1990; 15(1): 127-1 36), the hygromycin B phosphotransferase gene (Gen Bank Acces- 
sion NO: K 01193) and the shble resistance gene, which confers resistance to the 
bleomycin antibiotics such as zeocin. Further examples of selection marker genes are 

20 genes which confer resistance to 2-deoxyglucose-6-phosphate (WO 98/45456) or 

phosphinothridn and the like, or those which confer a resistance to antimetabolites, for 
example the dhfrgene (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994) 142-149). Ex- 
amples of other genes which are suitable are trpB or hisD (Hartman SC and Mulligan 
RC, Proc Natt Acad Sci U S A. 85 (1988) 8047-8051)- Another suitable gene is the 

25 mannose phosphate isomerase gene (WO 94/20627) P the ODC (ornithine decarboxy- 
lase) gene (McConlogue, 1987 in: Current Communications in Molecular Biology, Cold 
Spring Harbor Laboratory, Ed.) or the Aspergillus terreus deaminase (Tamura K et al., 
Biosd Biotechnol Biochem. 59 (1995) 2336-2338). 

30 Transformation" describes a process for introducing heterologous DNA into a pro- or 
eukaryotic cell The term transformed cell describes not only the product of the trans- 
formation process per se, but also all of the transgenic progeny of the transgenic 
organism generated by the transformation. 

35 Target/target protein M ; a polypeptide encoded via the nucleic acid sequence according 
to the invention (this term is defined herein below), which may take the form of an en- 
zyme in the traditional sense or, for example, of a structural protein, a protein relevant 
for developmental processes, regulatory protein such as transcription factors, kinases, 
phosphatases, receptors, channel subunits, transport proteins, regulatory subuntts 

40 which confer substrate or activity regulation to an enzyme complex. All of the targets or 
sites of action share the characteristic that their functional presence is essential for 
survival or normal development and growth. 
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"Transgenic" : referring to a nucleic acid sequence, an expression cassette or a vector 
comprising a nucleic acid sequence according to the invention or an organism trans- 
formed with the abovementioned nucleic acid sequence, expression cassette or vector, 
5 the term transgenic describes all those constructs which have been generated by ge- 
netic engineering methods in which either the nucleic acid sequence of the target pro- 
tein or a genetic control sequence linked operably to the nucleic acid sequence of the 
target protein or a combination of the abovementioned possibilities are not in their natu- 
ral genetic environment or have been modified by recombinant methods. In this con- 
10 text, the modification can be achieved, for example, by mutating one or more nucleo- 
tide residues of the nucleic acid sequence in question. 

Intracellular protein degradation and its regulation are important for several processes 
like recycling of aminoacids, prevention of protein agglomeration and regulation of sig- 
15 naling processes (e.g. signalling of phytohormones). Cytosolic Proteins that are to be 
degraded are ubiquitinylated at the N-terminus and delivered to the proteasome which 
is established as a complex of a large number of protein components in eucaroytes. 
Roughly 12% of the genes in Arabidopsis thaliana are encoding proteins envolved in 
protein degradation by the ubiquitin pathway. 

20 

The stroma of ptant chloroplasts contains a unique ubiquitin-independent, ATP- 
dependent protease consisting of two mayor components, a serine-type protease 
(CIpP) and an ATPase (CIpC, ~D, -X) both of which are encoded by enzyme families in 
Arabidopsis thaliana (for details on the differing nomenclatures in literature see Adam 

25 et aL 2001 , Plant Physiology 125, pp.1912-18>. Six unique CIpP isoforms (ClpP1-6) are 
nuclear encoded in Arabidopsis and at least one CIpP ist encoded in the plastid ge- 
nome (pCIpP) all of which carry the three conserved active site aminoacids characteris- 
tic^ for a catalytic triade of serine proteases. Further three nuclear encoded CIpP- 
Isoforms which miss the conserved amino acid residues of the catalytic triade are 

30 found in Arabidopsis (ClpR1 ? ClpR3 t ClpR4). The catalytic activity of CIpR-type CJpP- 
Isoforms has not been shown so far. 

At least one CIpP and two CIpX proteins may be targeted to mitochondria In Arabidop- 
sis as deduced from N-terminal signal sequences. CIpP Proteases are conserved in 

35 bacteria. The CIpP protease in E.coli was formerly known as "protease Ti". A knock out 
of the protease Ti was shown to be not lethal. E.coli CipP is assembled as a complex 
of 14 CIpP subunits in two heptameric rings, Co-immunoprecipitatron suggests com- 
plexes of similar sizes and an ATP-dependet interaction of CIpP and CIpC subunits in 
Chloroplasts of Arabidopsis thaliana (Halperin etaf. 2001, Planta 213, pp. 614-619). 

40 Furthermore, a 350kDa CIpP complex has been identified in Arabidopsis chloroplasts 
using blue native gel electrophoresis. The complex presumingly containins most of the 
known CIpP Isoenzymes (Benoit-Peltier et aL 2001, Journal of Biological Chemistry 
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276. pp. 16348-16327). Consequently the complexity and redundancy of plant Clp pro- 
teases is high and detailed information about composition of the clp complex and the 
functional role of its subunits remain to be clarified. Particularly the role of CIpP redun- 
dancy is still unclear. 

5 

The CIpP subunit is capable of activly hydrolysing peptides of max. 5 aminoacids in 
vitro. ClpA,B.C subunits constitute ATP-hydrolysing chaperones which unfold target- 
proteins and present them for hydrolysis to CIpP (Porankiewicz et al. 1999, Molecular 
Microbiology 32. 449^58). Involvement of CIpP in the degradation of the cytochrome 
1 0 b6f complex an PSII has been decribed in Chlamydomonas (Majeran et al. 2001 Plant 
Physiology 23+, pp. 421^433). Functional properties of CIpR-type Clp-Proteases as 
well as the CIpP like Proeases are yet to be determined. 

Surprisingly, it has been found within the scope of the present invention that plants in 
which a Clp protease was reduced in a selective manner have phenotypes which are 
15 comparable with phenotypes generated by herbicide application. Drastic growth retar- 
dation and damage such as were observed. 

The present invention relates to the use of a polypeptide, which has the activity of nu- 
clear encoded Clp-protease in a method for identifying herbicides, preferably of a poly- 
20 peptide, which has the activity of nuclear encoded Clp-protease, which is 

a) selected from the group consisting of ClpP1 -protease, ClpP2-protease. ClpP3- 
protease, ClpP4-protease and apP6-protease; or 

25 b) selected from the group consisting of ClpR1 -protease, ClpR3-protease. ClpR4- 
protease; or 

c) ClpP-Iike-protease, wherein more preferably 

30 a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:1,or 

35 M > a nuc, e'C acid sequence which, owing to the degeneracy of the genetic 

code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
40 whicn nas an identity with SEQ ID NO: 1 of has at least 50%; or 
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iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1 f which is encoded by an amino add sequence that has at least an 
identity of 50% with the SEQ ID NO:2; 

5 b) the ClpP2-protease encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 

1 0 ») a nucleic acid sequence which, owing to the degeneracy of the genetic 

code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
1 5 which has an identity with SEQ ID NO:3 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:3, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:4; 

20 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:5, or 

25 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:6 by back translating, or 

30 iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:5 

which has an identity with SEQ ID NO:5 of has at least 50%;or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:5, which is encoded by an amino acid sequence that has at least an 
35 Identity of 50% with the SEQ ID NO:6; 

d) the ClpP4~protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
40 NO:7, or 
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ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 

5 Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:7 

which has an identity with SEQ ID NO:7 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:7, which is encoded by an amino acid sequence that has at least an 
1 0 identity of 50% with the SEQ ID NO:8; 

e) the ClpPe-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:9, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:10 by back translating, or 

20 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:9 
which has an identify with SEQ ID NO:9 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:9, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:10; 

f) the CIpRl-protease is encoded by a nucleic acid sequence which comprises: 

30 ') a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:11,or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO: 1 2 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

40 iv ) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:1 1, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 12; 
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g) the ClpR3-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:13, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 3 
which has an identity with SEQ ID NO:13 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
15 NO:1 3, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:14; 

h) the ClpR4-prqtease is encoded by a nucleic acid sequence which comprises: 

20 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:15. or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

25 NO:1 6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 15 
which has an identity with SEQ ID NO:15 of has at least 50%; or 

30 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:16; 



i) the CIpP like-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 



ii) a nucleic add sequence which, owing to the degeneracy of the genetic 
40 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO:18 by back translating, or 



20:6S:£T COOZ'ZT'ZO 6X0 
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a functional equivalent of nucleic acid sequence shown in SEQ ID NO:17 
which has an identity with SEQ ID NO: 17 of has at least 50%; or 



iv) 



35 



40 



a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ id NO: 18; 

wherein the sequences b) i-iv. e) Hv, f) Wv and are especially preferred 
The term -comprising" in relation to a nucleic acid sequence means that the nucleic 
acid sequence can be flanked by additional nucleic acid sequences that have on the 5' 

Itt^tZnH * °?K n 5 ' Gnd ° r °" * he 3 ' end on end a Se «" ence 'ength of 
at least 1000 bp, preferably at least 500 bp, more preferably at least 250bp, most pref- 
erably at least lOObp. " p 

The functional equivalent according to the invention of SEQ ID NO:1 as described in a) 
...). which encodes a polypeptide, which has the activity of nuclear encoded Clp- 

^™ and ^ fSaSt an identity ° f 50% « 51% - 52% > 53 %. 54%. 55%, 56%. 57% 
58%. 59%, 60%. 61%. 62%. 63%. 64% or 65% or preferably of 66%. 67%. 68%. 69% 

l£ I™ ?2% ' 73% * 74% ' 75% ' ?6% ' 77% ' 78% or 79% "*» P^'y of 800/! 
llol' f f ' % ' M% ' 85% ' 85% ' 87% ' 83% ' 89% or 90% »™»t preferably of 91%. 
92%, 93%. 94%. 95%. 96%. 97%, 98% or 99% with SEQ ID NO:1 

The functional equivalents of the nucleic acid sequence SEQ ID NO:1 set forth in a) iv 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
So; P T«t aS l!? d 3t ,e3St an ld6ntlty ° f 50% ' 51% ' 52% ' 53%. 54%, 55%, 56%. 
fiQot ™ t ™ 61% ' 62% ' 63% ' 64% ° r 65% ° r Preferab,y ° f 66% ' 67 %. 68%. 

2?" 2?' 71%< 72%> 73% ' 74% ' ?5% ' 76% " 77% ' 78% or 79 % ™e preferably of 
60%. 81%. 82%. 83%. 84%. 85%. 66%, 87%, 88%. 89% or 90% most preferably of 
91 /o, 92%. 93%. 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:2. 

The functional equivalent according to the invention of SEQ ID NO:3 as described in b) 
111), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 

22^' and haS 3t ,SaSt ^ identity ° f 50% > 51 % > 52% - 53% - 54 %- 55%. 56%, 57% 
™ ' 12' ? % * 61 %> 62% ' 63% ' 64% ° r 65% ° r P referab, V of 6 ^%. 67%. 68%, 69%'. 

17: ■ r?r°' 73% - 74% ' 75% ' 76% - 78% «■ 79% ««• p-*-** of 6 0 %. 

olt f?" J 3 I°' M%! 85% ' 86% ' 87% ' 88% - B9% or 90 % ^st preferably of 91%. 
92%, 93%. 94%, 95%, 96%, 97%. 98% or 99% with SEQ ID NO:3. 

An example of a functional equivalent of SEQ ID NO: 3 is the nucleic acid sequence of 
Arab.dops.s thaliana (Gene Bank Ace. No. AB022327). This sequence is herein incor- 
porated by reference. 



ex eoos-^T-^o ozo 
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The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:3 in b) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clo-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%. 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
5 69%, 70%. 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 63%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% With SEQ ID NO:4. 

The functional equivalent according to the invention of SEQ ID NO:5 as described in c) 
10 iii). which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%. 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%. 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%. 77%, 78% or 79% more preferably of 80%. 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
15 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:5. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:5 set forth in c) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
20 57%, 58%. 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%. 68%. 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%. 83%, 84%, 85%. 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:6. 

25 The functional equivalent according to the invention of SEQ ID NO:7 as described in d) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%. 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%. 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%. 

30 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%. 95%, 96%, 97%, 98% or 99% with SEQ ID NO:7. 

The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:7 in d) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
35 Clp-protease and has at least an identity of 50%, 51 %, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%. 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%. 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%. 94%. 95%, 96%, 97%, 98% or 99% With SEQ ID NO;7. 

40 

The functional equivalent according to the invention of SEQ ID NO:9 as described in e) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
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protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%. 60%, 61%. 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%. 
81%, 82%, 83%. 84%, 85%. 86%, 87%, 88%, 89% or 90% most preferably of 91%. 
5 92%, 93%, 94%. 95%, 96%, 97%, 98% or 99% With SEQ ID NO:9. 

The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:9 In e) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
10 57%, 58%. 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%. 68%, 
69%, 70%, 71%, 72%. 73%, 74%, 75%, 76%, 77%, 76% or 79% more preferably of 
80%, 81%. 82%, 83%. 84%, 85%, 86%, 87%. 88%, 89% or 90% most preferably of 
91%. 92%, 93%, 94%. 95%, 96%, 97%, 98% or 99% with SEQ ID NO:10. 

15 The functional equivalent according to the invention of SEQ ID NO:1 1 as described in f) 
iri), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%. 67%. 68%. 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 

20 81%, 82%, 83%, 84%, 85%, 86%, 87%. 88%, 89% or 90% most preferably of 91%. 
92%. 93%, 94%. 95%, 96%, 97%. 98% or 99% with SEQ ID NO:11. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:11 set forth in f) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
25 Clp-protease and has at least an identity of 50%. 51%. 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%. 67%. 68%. 
69%, 70%. 71%, 72%. 73%, 74%, 75%, 76%. 77%. 78% or 79% more preferably of 
80%, 81%. 82%, 83%, 84%, 85%, 86%, 87%. 88%. 89% or 90% most preferably of 
91%, 92%, 93%. 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 12, 

30 

An example of a functional equivalent of SEQ ID NO: 11 is the nucleic acid sequence 
of Arabldopsis thaliana (Gene Bank Acc. No. AB022330). This sequence is herein in- 
corporated by reference, 

35 The functional equivalent according to the invention of SEQ ID NO: 13 as described in 
g) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%. 53%, 54%. 55%. 56%. 57%. 
58%. 59%, 60%, 61%. 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%. 74%, 75%, 76%. 77%. 78% or 79% more preferably of 80%. 

40 81 %, 82%, 83%, 84%, 85%, 86%, 87%. 88%, 89% or 90% most preferably of 91 %. 
92%, 93%. 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:13. 
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The functional equivalents of the nucleic acid sequence SEQ ID NO: 13 set forth in g) 
iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an Identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%. 60%, 61%. 62%. 63%, 64% or 65% or preferably of 66%, 67%. 68%. 
5 69%, 70%, 71%. 72%, 73%. 74%. 75%, 76%. 77%. 76% or 79% more preferably of 
80%. 81%. 82%. 83%, 84%. 85%, 86%. 87%, 88%. 89% or 90% most preferably of 
91%, 92%, 93%. 94%, 95%, 96%. 97%, 98% or 99% with SEQ ID NO: 14. 

The functional equivalent according to the invention of SEQ ID NO:15 as described in 
10 h) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%. 52%, 53%, 54%, 55%. 56%, 57%. 
58%. 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%. 69%. 
70%, 71%. 72%, 73%, 74%, 75%, 76%. 77%, 78% or 79% more preferably of 80%. 
81%, 82%, 83%. 84%, 85%, 86%, 87%. 88%, 89% or 90% most preferably of 91%, 
15 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:1 5. 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 15 set forth in h) 
iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%. 55%. 56%. 
20 57%. 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%. 67%, 68%, 
69%. 70%. 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%. 83%. 84%, 85%, 86%. 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID N0:16. 

25 The functional equivalent according to the invention of SEQ ID NO:1 7 as described in i) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%. 76%, 77%, 78% or 79% more preferably of 80%, 

30 81 %. 82%, 83%, 84%. 85%, 86%. 87%. 88%, 89% or 90% most preferably of 91 %, 
92%, 93%, 94%. 95%. 96%, 97%. 98% or 99% with SEQ ID NO:17. 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in i) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
35 Clp-protease and has at least an identity of 50%, 51 %, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91 %, 92%, 93%. 94%, 95%, 96%. 97%, 98% or 99% with SEQ ID NO:1 8. 

40 
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An example of a functional equivalent of SEQ ID NO: 17 is the nucleic acid sequence 
of Arabidopsis ihaliana (Gene Bank Acc, No. AK1 18525). This sequence is herein in- 
corporated by reference. 

5 Furthermore claimed within the scope of the present invention are plant nucleic acid 
sequence 



I) encoding a CIpP2-protease comprising: 



10 a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:3, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino add sequence shown in SEQ ID 

1 5 NO:4 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:3 of has at least 66%; or 

20 d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NQ:1 1, which is encoded by an amino add sequence that has at least an 
identity of 76% with the SEQ ID NO:4; 



25 



35 



II) encoding a ClpR1 -protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NQ.H.or 



b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
30 code, can be deduced from the amino add sequence shown in SEQ ID 

NO: 12 by backtransiating, or 



c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:1 1 of has at least 69%; or 

d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:11 ( which is encoded by an amino acid sequence that has at least an 
identity of 71% with the SEQ ID NO:12; 



40 HI) encoding a ClpP-like-protease comprising: 
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a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
5 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO:18 by backtranslating. or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO: 17 of has at least 67%; or 

1 0 d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:17, which is encoded by an amino acid sequence that has at least an 
identity of 79% with the SEQ ID NO:18; 

The functional equivalent of SEQ ID NO:3 set forth in I c) has at least an identity of 
1 5 66%. 67%, 68%. 69%. 70%, 71 %, 72%, 73%. 74%, by preference at least 75%. 76%, 
77%, 78%, 79%, 80%, 81%, 82% or 83%. preferably at least 84%, 85%. 66%, 87%, 
88%, 89%, 90%, 91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 
97%. 98% or 99% with SEQ ID NO:3. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:3 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 77%, by preference at least 78%, 79%, 
80%, 81%, 62% or 83%, preferably at least 84%, 85%. 86%, 87%, 88%, 89%, 90%. 
91%. 92%, 93%, especially preferably at Jeast 94%, 95%, 96%, 97%, 98%. 99% with 
SEQ ID NO:4. 

The functional equivalent of SEQ ID NO:l 1 set forth in II c) has at least an identity of 
69%, 70%, 71%, 72%, 73% or 74%, by preference at least 75%, 76%, 77%, 78%, 79%, 
80%. 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%. 89%. 90%, 
30 91 %. 92% or 93%, especially preferably at least 94%, 95%, 96%, 97%, 98% or 99% 
withSEQIDNO:11. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:1 1 set forth in II) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
35 Clp-protease and has at least an identity of 71 % by preference at least 72%, 73%,74%, 
75%. 76%, 77%. 78%, 79%. 80%, 81%, 82%, 83%, preferably at least 84%, 85%, 86%, 
87%, 88%, 89%. 90%. 91%, 92%, 93%, especially preferably at least 94%, 95%, 96%, 
97%, 98%, 99% With SEQ ID NO:12. 

40 The functional equivalent of SEQ ID NO:17 set forth in I c) has at least an identity of 
67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, by preference at least 75%, 76%, 77%, 
78%, 79%. 80%. 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 



20 



25 
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89%, 90%, 91%, 92% or 93% r especially preferably at least 94%, 95%, 96%, 97%, 
98% or 99% with SEQ ID NO: 17. 



The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 79%, by preference at least 79%, 80%. 
81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, especially preferably at least 94%, 95%. 96%, 97%, 98%, 99% with SEQ ID 
NO: 18. 

The polypeptides encoded by the abovementioned nucleic acid sequences according 
to I c)-d). II c)-d) and lit c)-d) are likewise claimed. The functional equivalents as de- 
scribed in c) and d) are distinguished by the same functionality, i.e. they have the activ- 
ity of a clp-protease. 

The nucleic acid sequences I c)-d), II c)-d> and III c>d) are hereinbelow termed NCLP- 
sequences. 



The term "nucleic acid sequences according to the invention" which is used hereinbe- 
20 low refers to nucleic acid sequences encoding a polypeptide, which has the activity of 
nuclear encoded Clp-protease in a method for identifying herbicides, preferably of a 
polypeptide, which has the activity of nuclear encoded Clp-protease, which is 

a) selected from the group consisting of ClpP1 -protease, ClpP2-protease, ClpP3- 
25 protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of CIpRl -protease, ClpR3-protease. ClpR4- 
protease; or 



30 c) ClpP-like-protease, wherein more preferably 

a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
35 NO:1.or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

40 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:i 
which has an identity with SEQ ID NO:1 of has at least 50%; or 
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20 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:2; 

b) the ClpP2-protease encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 



30 



35 



a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ id 
NO:4 by back translating, or 



1 5 Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 

which has an identity with SEQ ID NO:3 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:3 ' w™ 0 " ls encoded by an amino acid sequence that has at least an 
20 identity of 50% with the SEQ ID NO:4; 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
25 NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:6 by back translating, or 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NOS 
which has an identity with SEQ ID NO:5 of has at least 50%;or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:5, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:6; 



d) the ClpP4-protease is encoded by a nuc/eic acid sequence which 



comprises: 



40 



i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:7, or 
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H) 



a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 



Hi) a functional equivalent of nucleic acid sequence shown in SEQ JD NO:7 
which has an identity with SEQ ID NO:7 of has at least 50%; < 



or 



IV) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:7 - whicn js encoded by an amino acid sequence that has at least 
1 0 identity of 50% with the SEQ ID NO:8; 



an 



e) the Clppe-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:10 by back translating, or 

20 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:9 
which has an identity with SEQ id NO:9 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:9 « which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:1 0; 

f) the ClpR1-protease is encoded by a nucleic acid sequence which comprises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:11.or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO:1 2 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

40 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:l 1. which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:12; 
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10 



35 



g) the ClpR3-protease is encoded by a nucleic add sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:13, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 13 
which has an identity with SEQ ID NO:13 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
15 NO: 13. which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:14; 

h) the ClpR4-protease is encoded by a nucleic acid sequence which comprises: 

20 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:15 f or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

25 NO:16 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO;1 5 
which has an identity with SEQ ID NO:1 5 of has at least 50%; or 

30 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:16; 



i) the CIpP like-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
40 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO; 18 by back translating, or 
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Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 17 
which has an identity with SEQ ID NO:17 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
5 NO: 17, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:18; 

wherein the sequences b) Mv, e) i-iv, f) i-iv and are especially preferred 

10 A polypeptide, which has the activity of nuclear encoded Clp-protease and is encoded 
by a nucleic acid sequence according to the invention are hereinbelow simply referred 
to as "CLP*. 

Reduced amounts of glyoxysomal CLP cause growth retardation and necrotic and 
1 5 chlorotlc leaves in plants. 

The gene products of the nucleic adds according to the invention constitute novel tar- 
gets for herbicides, which make possible the provision of novel herbicides for control- 
ling undesired plants. Moreover, the gene products of the nucleic acids according to 
20 the invention constitute novel targets for growth regulators which make possible the 
provision of novel growth regulators for regulating the growth of plants. 

Undesired plants are understood as meaning, in the broadest sense, all those plants 
which grow at locations where they are undesired, for example: 

25 

Dicotyledonous weeds of the genera: Sinapis, Lepidium, Galium, Stellaria, Matricaria, 
Anthernis, Galinsoga, Chenopodium, Urtica, Senecio, Amaranthus, Portulaca, Xan- 
thium. Convolvulus, Ipomoea, Polygonum, Sesbania, Ambrosia, Cirsium, Carduus, 
Sonchus, Solanum, Rorippa, Rotala, Lindernia, Lamium, Veronica, Abutilon, Ernex, 
30 Datura, Viola, Galeopsis, Papaver, Centaurea, Trifolium, Ranunculus, Taraxacum. 

Monocotyledonous weeds from the genera: Echinochloa, Setaria, Panicum, Digitaria, 
Phleum, Poa, Festuca, Eleusine, Brachiaria, Lolium, Brornus, Avena, Cyperus, Sor- 
ghum. Agropyron, Cynodon, Monochoria, Fimbristylis, Sagittaria, Eleocharis, Scirpus, 
35 Paspalum, Ischaemum, Sphenoclea, Dactyloctenium, Agrostis, Alopecurus, Apera. 

SEQ ID NO;1; 3, 5, 7, 9, 11, 13, 15, 17or 19-21 or parts of SEQ ID NO; 1; 3, 5, 7, 9, 
1 1, 13, 15, 17 or 19-21 can be used for the preparation of hybridization probes. The 
preparation of these probes and the experimental procedure is known. For example, 
40 this can be effected via the selective preparation of radioactive or nonradioactive 

probes by PGR and the use of suitably labeled oligonucleotides, followed by hybridiza- 
tion experiments. The technologies required for this purpose are detailed, for example, 
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in T Maniatis, E.F. Fritsch and J. Sambrook, "Molecular Cloning: A Laboratory Man- 
ual", Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989). The probes in 
question can furthermore be modified by standard technologies (Lit. SDM or random 
mutagenesis) in such a way that they can be employed for further purposes, for exam- 
5 pie as a probe which hybridizes specifically with mRNA and the corresponding coding 
sequences in order to analyze the corresponding sequences in other organisms. 

The abovementioned probes can be used for the detection and isolation of functional 
equivalents of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16 or 18 from other plant species on 
10 the basis of sequence identities. In this context, part or all of the sequence of the SEQ 
ID NO:2 in question is used as a probe for screening a genomic or cDNA library of the 
plant species in question or in a computer search for sequences of functional equiva- 
lents in electronic databases. 

15 Preferred plant species are the undesired plants which have already been mentioned 
at the outset 

The invention furthermore relates to expression cassettes comprising 
20 a) genetic control sequences in operable linkage with a NCLP sequence; or 

b) additional functional elements, or 

c) a combination of a) and b); 
and to the use of expression cassettes comprising 

a) genetic control sequences in operable linkage with a nucleic acid sequence ac- 
cording to the invention, 

b) additional functional elements, or 

c) a combination of a) and b); 

35 for expressing a CLP, which can be used in in vitro assay systems. Both embodiments 
of the above-described expression cassettes are referred in the following text as ex- 
pression cassette according to the invention. 

in a preferred embodiment, an expression cassette according to the invention com- 
40 prises a promoter at the 5' end of the coding sequence and, at the 3' end, a transcrip- 
tion termination signal and, if appropriate, further genetic control sequences which are 
linked operably with the interposed nucleic acid sequence according to the invention. 
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The expression cassettes according to the invention are also understood as meaning 
analogs which can be brought about, for example, by a combination of the individual 
nucleic acid sequences on a polynucleotide (multiple constructs), on a plurality of 
5 polynucleotides in a cell (^transformation) or by sequential transformation. 

Advantageous genetic control sequences under point a) for the expression cassettes 
according to the invention or for vectors comprising expression cassettes according to 
the invention are, for example, promoters such as the cos, tac, trp. tet, Ipp, lac, laclq, 
1 0 T7, T5, T3, gal, trc, ara, SP6, O-PR or the d-PL promoter, all of which can be used for 
expressing a CLP, in Gram-negative bacterial strains. 

Examples of further advantageous genetic control sequences are present, for example, 
in the promoters amy and SP02, both of which can be used for expressing a CLP. in 

1 5 Gram-positive bacterial strains, and in the yeast or fungal promoters AUG1, GPD-1 . 
PX6, TEF, CUP1, PGK, GAP1. TPI, PHOS, AOX1, GAL10/CYC1, CYC1. OliC, ADH, 
TDH, Kex2, MFA or NMT or combinations of the abovementioned promoters (Degryse 
etal., Yeast 1995 June 15; 11(7):629-40; Romanos etal. Yeast 1992 June;8(6):423- 
88; Benito et al. Eur. J. Plant Pathol. 104. 207-220 (1998); Cregg et al. Biotechnology 

20 (N Y) 1993 Aug;11(8):905-10; Luo X, Gene 1995 Sep 22;163(1):127-31: Nacken et al 
Gene 1996 Oct 10;175(1-2): 253-60; Turgeon etal., Mol Cell Biol 1987 Sep;7(9):3297-' 
305) or the transcription terminators NMT, Gcy1 , TrpC, AOX1. nos, PGK or CYC1 (De- 
gryse et aL, Yeast 1995 June 15; 1 1 (7):629-40; Brunelli et al. Yeast 1993 (Dec9(12): 
1309-18; Frisch et aL, Plant Mol. Biol. 27(2), 405^09 (1995); Scorer et al., Biotechnol- 

25 ogy (N.Y. 12 (2). 181-184 (1994), Genbank acc. number Z46232; Zhao et al. Genbank 
acc number : AF049064; Punt et aL, (1987) Gene 56 (1 ), 1 17-124), all of which can be 
used for expressing CLP, in yeast strains. 

Examples of genetic control sequences which are suitable for expression in insect cells 
30 are the polyhedrin promoter and the p1 0 promoter (Luckow, VA. and Summers M D 
(1 988) Bio/Techn. 6, 47-55). 

Advantageous genetic control sequences for expressing CLP. in cell culture, in addition 
to polyadenylation sequences such as, for example, from simian virus 40, are eu- 
35 karyotrc promoters of viral origin such as. for example, promoters of the polyoma virus, 
adenovirus 2. cytomegalovirus or simian virus 40. 

Further advantageous genetic control sequences for expressing nuclear encoded CIp 
Protease, in plants are present in the plant promoters CaMV/35S [Franck et al Cell 
40 21(1980) 285-294], PRP1 [Ward et al., Plant. Mol. Biol. 22 (1993)], SSU, OCS.LEB4, 
USP, STLS1 , B33, NOS; FBPaseP (WO 98/1 8940) or in the ubiquitin or phase'olin 
promoter; a promoter which is preferably used being, in particular, a plant promoter or 
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a promoter derived from a plant virus. Especially preferred are promoters of viral origin 
such as the promoter of the cauliflower mosaic virus 35S transcript (Franck et al. f Cell 
21 (1980), 285-294; Odell et aL, Nature 313 (19B5), 810-812). Further preferred consti- 
tutive promoters are, for example, the agrobacterium nopaline synthase promoter, the 
5 TR double promoter, the agrobacterium OCS (octopine synthase) promoter, the ubiq- 
uitin promoter, (Holtorf S et aL, Plant Mol Biol 1995, 29:637-649), the promoters of the 
vacuolar ATPase subunits, or the promoter of a praline-rich wheat protein (WO 
91/13991), 

1 0 The expression cassettes may also comprise, as genetic control sequence, a chemi- 
cally inducible promoter, by which the expression of the exogenous gene in the plant 
can be controlled at a specific point in time. Such promoters, such as, for example, the 
PRP1 promoter (Ward et aL, Plant. Mol. Biol, 22 (1993), 361-366), a salicylic-acid- 
inducible promoter (WO 95/19443), a benzenesulfonamide-inducible promoter (EP-A- 

1 5 03881 86), a tetracyclin-inducible promoter (Gate et aL, (1 992) Plant J. 2, 397404), an 
abscisio-acid-inducible promoter (EP-A 335528) or an ethanol- or cyclohexanone- 
inducible promoter (WO 93/21334) may also be used. 

Furthermore, suitable promoters are those which confer tissue- or organ-specific ex- 

20 pression in. for example, anthers, ovaries, flowers and floral organs, leaves, stomata, 
trichomes, stems, vascular tissues, roots and seeds. Others which are suitable in addi- 
tion to the abovementioned constitutive promoters are, in particular, those promoters 
which ensure leaf-specific expression. Promoters which must be mentioned are the 
potato cytosolic FBPase promoter (WO 97/05900), the rubisco (ribulose-1,5- 

25 bisphosphate carboxylase) SSU (small subunit) promoter or the ST-LSI promoter from 
potato (Stockhaus et aL, EMBO J. 8 (1989), 2445 - 245). Promoters which are further- 
more preferred are those which control expression in seeds and plant embryos. Exam- 
ples of seed-specific promoters are the phaseolin promoter (US 5,504,200, Bustos MM 
et aL, Plant CelL 1 989; 1 (9):839-53), the promoter of the 2S albumin gene (Joseffson 

30 LG et aL, J Biol Chem 1987, 262:12196-12201), the legumin promoter (Shirsai A et aL, 
Mol Gen Genet 1989;215(2):326-331), the USP (unknown seed protein) promoter 
(Baumlein H et aL, Molecular & General Genetics 1991, 225(3):459-67) 1 the napin 
gene promoter (Stalberg K, et aL, L Planta 1996, 199:515-519), the sucrose binding 
protein promoter (WO 00/26388) or the LeB4 promoter (Baumlein H et aL, Mol Gen 

35 Genet 1991, 225: 121-128; Fiedler. U. et aL, Biotechnology (NY) (1995), 13 (10) 1090). 

Further promoters which are suitable as genetic control sequences are, for example, 
specific promoters for tubers, storage roots or roots, such as, for example, the class I 
patatin promoter (B33), the potato cathepsin D inhibitor promoter, the starch synthase 
40 (GBSS1 ) promoter or the sporamin promoter, fruit-specific promoters such as, for ex- 
ample, the fruit-specific promoter from tomato (EP-A 409625), fruit-maturation-specific 
promoters such as, for example, the fruit-maturation-specific promoter from tomato 
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(WO 94/21794), inflorescence-specific promoters such as, for example, the phytoene 
synthase promoter (WO 92/16635) or the promoter of the P-rr gene (WO 98/22593). or 
plastid- or chromoplast-specific promoters such as, for example, the RNA polymerase 
promoter (WO 97/06250), or else the Glycine max phosphoribosyl-pyrophosphate ami- 
5 dotransferase promoter (see also Genbank Accession No. U87999), or another node- 
specific promoter as described in EP-A 249676. 

Additional functional elements b) are understood as meaning, by way of example but 
not by limitation, reporter genes, replication origins, selection markers and what are 

10 known as affinity tags, in fusion with CLP, directly or by means of a linker optionally 

comprising a protease cleavage site. Further suitable additional functional elements are 
sequences which ensure that the product is targeted into the apoplasts, into plastids, 
the vacuole, the mitochondrion, the peroxisome, the endoplasmatic reticulum (ER) or, 
owing to the absence of such operative sequences, remains in the compartment where 

15 it is formed, the cytosol, (Kermode. Crit. Rev. Plant Sci. 15, 4 (1996), 285-423). 

Also in accordance with the invention are vectors comprising at least one copy of the 
nucleic acid sequences according to the invention and/or the expression cassettes ac- 
cording to the invention. 

20 

In addition to plasmids, vectors are furthermore also understood as meaning all of the 
other known vectors with which the skilled worker is familiar, such as, for example, 
phages, viruses such as SV40, CMV, baculovirus, adenovirus, transposons, IS ele- 
ments, phasmids. phagemids, cosmids or linear or circular DNA. These vectors can be 
25 replicated autonomously in the host organism or replicated chromosomally; chromo- 
somal replication is preferred. 

In a further embodiment of the vector, the nucleic acid construct according to the inven- 
tion can advantageously also be introduced into the organisms in the form of a linear 
30 DNA and integrated into the genome of the host organism via heterologous or homolo- 
gous recombination. This linear DNA may consist of a linearized plasmid or only of the 
nucleic acid construct as vector, or the nucleic acid sequences used. 

Further prokaryotic or eukaryotic expression systems are mentioned in Chapters 16 
35 and 17 in Sambrook et ah, "Molecular Cloning: A Laboratory Manual." 2nd ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989. Further advantageous vectors are described in Hellens et al. (Trends in 
plant science, 5, 2000). 

40 The expression cassette according to the invention and vectors derived therefrom can 
be used for transforming bacteria, cyanobacteria, (for example of the genus Synecho- 
cystis, Anabaena, Calothrix, Scytonema, Oscillatoria. Plectonema and Nostoc), proteo- 
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bacteria such as, for example, Magnetococcus sp, MC1, yeasts, filamentous fungi and 
algae and eukaryoatic nonhuman cells (for example insect cells) with the aim of pro- 
ducing CLP. recombinant!^ the generation of a suitable expression cassette depending 
on the organism in which the gene is to be expressed, 

5 

Vectors comprising a NCLP sequence form part of the subject-matter of the present 
invention. 

In a further advantageous embodiment, the nucleic acid sequences according to the 
10 invention may also be introduced into an organism by themselves. 

If, in addition to the nucleic acid sequences, further genes are to be introduced into the 
organism, they can all be introduced into the organism together in a single vector, or 
each individual gene can be introduced into the organism in each case in one vector, it 
15 being possible to introduce the different vectors simultaneously or in succession. 

In this context, the introduction, into the organisms in question (transformation), of the 
nucleic acid(s) according to the invention, of the expression cassette or of the vector 
can be effected in principle by all methods With which the skilled worker is familiar. 

20 

In the case of microorganisms, the skilled worker will find suitable methods in the text- 
books by Sambrook, J. et ah (1989) "Molecular cloning: A laboratory manual", Cold 
Spring Harbor Laboratory Press, von F.M. Ausubel et al. (1994) "Current protocols in 
molecular biology", John Wiley and Sons, by D.M. Glover et aL, DNA Cloning Vol1 , 

25 (1995), IRL Press (ISBN 019-963476-9), by Kaiser et al. (1994) Methods in Yeast Ge- 
netics, Cold Spring Habor Laboratory Press or Guthrie et aL "Guide to Yeast Genetics 
and Molecular Biology", Methods in Enzymology, 1994, Academic Press. In the trans- 
formation of filamentous fungi, the methods of choice are firstly the generation of pro- 
toplasts and transformation with the aid of PEG (Wiebe et al. (1997) MycoL Res. 101 

30 (7): 971-877; Proctor et al. (1997) Microbiol. 143, 2538-2591), and secondly the trans- 
formation with the aid of Agrobacterium tumefaciens (de Groot et aL (1998) Nat. Bio- 
tech. 16, B39-842). 

In the case of dicots, the methods which have been described for the transformation 
35 and regeneration of plants from plant tissues or plant cells can be exploited for tran- 
sient or stable transformation. Suitable methods are the biolistic method or the trans- 
formation of protoplasts (cf., for example, Willmitzer, L, 1993 Transgenic plants. In; 
Biotechnology, A Multi-Volume Comprehensive Treatise (H.J. Rehm, G. Reed, A. 
Puhler, P. Stadler, eds.), Vol. 2, 627-659, VCH Weinheim-New York-Basle- 
40 Cambridge), electroporation, the incubation of dry embryos in DNA-containing solution, 
microinjection and the agrobacterium-radiated gene transfer. The abovementioned 
methods are described, for example, in B. Jenes et al., Techniques for Gene Transfer, 



Z0:90=H C002"2T*Z0 SCO 



BASF Akf lengesellschaft 



20030949 



PF 55117 EP 



25 



30 



35 



40 



29 

in: Transgenic Plants, Vol. 1, Engineering and Utilization, edited by S.D. Kung and R 
Wu Academic Press (1993) 128-143 and in Potrykus, Annu. Rev. Plant Physiol Plant 
Molec.Biol. 42 (1 991 ) 205-225). 

5 The transformation by means of agrobacteria, and the vectors to be used for the trans- 
formats, are known to the skilled worker and described extensively in the literature 
(Bevan et al., NucL Acids Res. 12 (1984) 8711. The intermediary vectors can be inte- 
grated ,nto the agrobacterial Ti or Ri plasmid by means of homologous recombination 
owing to sequences which are homologous to sequences in the T-DNA. This plasmid 

i additionally contains the vir region, which is required for the transfer of the T-DNA In- 
termediary vectors are not capable of replication in agrobacteria. The intermediary vec- 
tor can be transferred to Agrobacterium tumefaciens by means of a helper plasmid 
(conjugation). Binary vectors are capable of replication both in E. col. and in agrobacte- 
ria They contain a selection marker gene and a linker or polylinker which are framed 
by the nght and left T-DNA border region. They can be transformed directly into the 
agrobacteria (Holsters et al. Mol. Gen. Genet. 163 (1978), 181-187) EP A 0 120 516- 
Hoekema, in: The Binary Plant Vector System Offsetdrukkerij Kanters B.V.. Alblasser- 
dam (1985). Chapter V; Fraley et al., Grit. Rev. Plant. Sol., 4: 1-46 and An et al. EMBO 
J. 4 (1985), 277-267). 

The transformation of monocots by means of vectors based on agrobacterium has also 
been described (Chan et al.. Plant Mol. Biol. 22(1993). 491-506; Hiei et al., Plant J. 6 
(1994) 271-282; Deng et al. Science in China 33 (1990), 28-34; Wilmink et al.. Plant 
Cell Reports 11,(1992)76-80; May etal. Biotechnology 13 (1995)486^92; Conner and 
Domisse; Int. J, Plant Sci. 153 (1992) 550-555; Ritchie et al. Transgenic Res. (1993) 
252-265). Alternative systems for the transformation of monocots are the transforma- 
tion by means of biofistic approach (Wan and Lemaux; Plant Physiol. 104(1994) 37. 
48; Vasil etal. Biotechnology 1 1 (1992), 667-674; Ritala etal., Plant Mol. Biol 24' 
(1994) 317-325; Spencer et al.. Theor. Appl. Genet. 79 (1990), 625-631), protoplast 
^!!!? ati0n ' thS e,ectr °P oratto " of partially permeabilized cells, and the introduction 
of DNA by means of glass fibers. In particular the transformation of maize has been 
descnbed repeatedly in the literature (cf.. for example, WO 95/06128; EP 0513849 AT 
EP 0465875 A1 ; EP 0292435 A1 ; Fromm et al., Biotechnology 8 (1 990) 833-844- ' 
Gordon-Kamm et al., Plant Cell 2 (1990), 603-618; Koziel et al.. Biotechnology ' 
11(1993) 194-200; Moroc etal., Theor Applied Genetics 80 (190) 721-726). 

The successful transformation of other cereal species has also already been described 
for example in the case of barley (Wan and Lemaux, see above; Ritala et al., see 
above; wheat (Nehra et al.. Plant J. 5(1994) 285-297). 

Agrobacteria which have been transformed with a vector according to the invention can 
likewise be used in a known manner for the transformation of plants, such as test 
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plants like Arabidopsis or crop plants like cereals, maize, oats, rye, barley, wheat, soya, 
rice, cotton, sugarbeet, canola, sunflower, flax, hemp, potato, tobacco, tomato, carrot, 
capsicum, oilseed rape, tapioca, cassava, arrowroot, Tagetes, alfalfa, lettuce and the 
various tree, nut and grapevine species, for example by bathing scarified leaves or leaf 
5 segments in an agrobacterial solution and subsequently growing them in suitable me- 
dia. 

The genetically modified plant cells can be regenerated via all methods with which the 
skilled worker is familiar. Such methods can be found in the abovementioned publica- 
10 tions by S.D. Kung and R. Wu, Potrykus or HOfgen and Wiilmitzer 

The transgenic organisms generated by transformation with one of the above- 
described embodiments of an expression cassette comprising a nucleic acid sequence 
according to the invention or a vector comprising the abovementioned expression cas- 

15 sette, and the recombinant CLP, which can be obtained from the transgenic organism 
by means of expression, form part of the subject matter of the present invention. The 
use of transgenic organisms comprising an expression cassette according to the inven- 
tion, for example for providing recombinant protein, and/or the use of these organisms 
in in-vivo assay systems likewise form part of the subject matter of the present inven- 

20 tion. 

Preferred organisms for the recombinant expression are not only bacteria, yeasts, 
mosses, algae and fungi, but also eukaryotic cell lines. 

25 Preferred mosses are Physcomitrefla patens or other mosses described in Kryptoga- 
men [Cryptogamial, VoL2, Moose, Fame [Mosses, Ferns], 1991, Springer Veriag (ISBN 
3540536515). 

Preferred within the bacteria are, for example, bacteria from the genus Escherichia, 
30 Erwinia, Flavobacterium, Alcaligenes or cyanobacteria, for example from the genus 
Synechocystis, Anabaena, Calothrix, Scytonerna, Oscillatoria, Plectonema and Nostoc, 
especially preferably Synechocystis or Anabaena. 

Preferred yeasts are Candida, Saccharomyces, Schizosaccheromyces, Hansenula or 
35 Pichia. 

Preferred fungi are Aspergillus, Trichoderma, Ashbya, Neurospora, Fusarium, Beauve- 
ria, Mortierella, Saprolegnia, Pythium, or other fungi described in Indian Chem Engr. 
Section B. Vol 37, No 1,2 (1995). 

40 

Preferred plants are selected in particular among monocotyledonous crop plants such 
as, for example, cereal species such as wheat, barley, sorghum or millet, rye, triticale, 
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maize, rice or oats, and sugarcane. The transgenic plants according to the invention 
are, furthermore, in particular selected from among dicotyledonous crop plants such as, 
for example, Brassicaceae such as oilseed rape, cress, Arabidopsis, cabbages or ca- 
nola; Leguminosae such as soyabean, alfalfa, pea, beans or peanut. Solanaceae such 
5 as potato, tobacco, tomato, egg plant or capsicum; Asteraceae such as sunflower, 
Tagetes, lettuce or Calendula; Cucurbitaceae such as melon, pumpkin/squash or zuc- 
chini, or linseed, cotton, hemp, flax, red pepper, carrot, sugar beet, or various tree, nut 
and grapevine species. 

1 0 In principle, transgenic animals such as, for example, C. elegans, are also suitable as 
host organisms. 

Also preferred is the use of expression systems and vectors which are available to the 
public or commercially available. 

15 

Those which must be mentioned for use in E. coli bacteria are the typical advanta- 
geous commercially available fusion and expression vectors pGEX [Pharmacia Biotech 
Inc; Smith, D.B. and Johnson, K.S. (1988) Gene 67:31-40], pMAL (New England Bio- 
labs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ). which contains glutathione 
20 S transferase (GST), maltose binding protein or protein A. the pTrc vectors (Amann et 
al., (1988) Gene 69:301-315). "pKK233-2" from CLONTECH, Palo Alto, CA and the 
"pET, and the "pBAD" vector series from Stratagene, La Jolla and the TOPO-TA vec- 
tor series drom Invitrogen. 

25 Further advantageous vectors for use in yeast are pYepSed (Baldari, et al., (1987) 
Embo J. 6:229-234), pMFa (Kuijan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al., (1987) Gene 54:1 13-123), and pYES derivatives, pGAPZ derivatives, 
pPICZ derivatives, and the vectors of the "Pichia Expression Kit" (Invitrogen Corpora- 
tion, San Diego, CA). Vectors for use in filamentous fungi are described in: van den 

30 Hondel, C.A.M. J.J. & Punt, P.J. (1 991 ) "Gene transfer systems and vector develop- 
ment for filamentous fungi, in: Applied Molecular Genetics of Fungi, J.F. Peberdy, et 
al., eds., p. 1-28, Cambridge University Press: Cambridge. 

As an alternative, insect cell expression vectors may also be used advantageously, for 
35 example for expression in Sf9, Sf21 or Hi5 cells, which are infected via recombinant 
Baculoviruses. Examples of these are the vectors of the pAc series (Smith et al. (1 983) 
Mol. Cell Biol. 3:2156-2165) and the pVL series (Luckiow and Summers (1989) Virol- 
ogy 170:31-39). Others which may be mentioned are the Baculovirus expression sys- 
tems "MaxBac 2.0 Kit" and "Insect Select System* 1 from Invitrogen, Carlsbad or 
40 "BacPAK Baculovirus Expression System" from CLONTECH, Palo Alto, CA. Insect 
cells are particularly suitable for overexpressing eukaryotic proteins since they effect 
postradiational modifications of the proteins which are not possible in bacteria and 
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yeasts. The skilled worker is familiar with the handling of cultured insect cells and with 
their infection for expressing proteins, which can be carried out analogously to known 
methods (Luckow and Summers, Bio/Tech. 6, 1988, pp.47-55; Glover and Hames 
(eds) in DNA Cloning 2, A practical Approach, Expression Systems, Second Edition. 
5 Oxford University Press, 1995, 205-244). 

Plant cells or algal cells are others which can be used advantageously for expressing 
genes. Examples of plant expression vectors can be found as mentioned above in 
Becker, D., et al. (1992) "New plant binary vectors with selectable markers located 
1 0 proximal to the left border", Plant Mol. Biol. 20: 1 1 95-1 1 97 or in Bevan, M.W. (1 984) 
"Binary Agrobacterium vectors for plant transformation", Nucl. Acid. Res. 12: 871 1- 
8721. 

Moreover, the nucleic acid sequences according to the invention can be expressed in 
1 5 mammalian cells. Examples of suitable expression vectors are pCDMB and pMT2PC, 
which are mentioned in: Seed, B. (1987) Nature 329:840 or Kaufman et al. (1987) 
EMBO J. 6:187-195). Promoters preferably to be used in this context are of viral origin 
such as, for example, promoters of polyoma virus, adenovirus 2, cytomegalovirus or 
simian virus 40. Further prokaryotic and eukaryotic expression systems are mentioned 
20 in Chapter 1 6 and 17 in Sambrook et al.. Molecular Cloning: A Laboratory Manual. 2nd 
ed.. Cold Spring Harbor Laboratory. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989. Further advantageous vectors are described in Hellens etal. 
(Trends in plant science, 5, 2000). 

25 The transgenic organisms which comprise a NCLP sequence are claimed within the 
scope of the present invention. 

All of the above-described embodiments of the transgenic organisms, which comprise 
at least one nucleic acid sequence according to the invention come under the term 
30 "transgenic organism according to the invention". 

The present invention furthermore relates to the use of CLP, in a method for identifying 
herbicidally active test compounds. 

35 The method according to the invention for identifying herbicidally active compounds 
preferably comprises the following steps: 

i. bringing CLP into contact with one or more test compounds under conditions 
which permit the test compound(s) to bind to a nucleic acid sequence according 

40 to the invention or to CLP, and 

ii. detecting whether the test compound binds to the CLP of i), or 
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in. 



5 iv. 



15 



detecting whether the test compound reduces or blocks the enzymatic or biologi- 
cal activity of CLP of i), or 

detecting whether the test compound reduces or blocks the transcription, transla- 
tion or expression of CLP of i). 



The detection in accordance with step (ii) of the above method can be effected using 
techniques which identify the interaction between the polypeptide and ligand. In this 
1 0 context, either the test compound or the enzyme can contain a detectable label such 
as, for example, a fluorescent label, a radioisotope, a chemiluminescent label or an 
enzyme label. Examples of enzyme labels are horseradish peroxidase, alkaline phos- 
phatase or luciferase. The subsequent detection depends on the label and is known to 
the skilled worker. 



In this context, five preferred embodiments which are also suitable for high-throughput 
methods (HTS) in connection with the present invention must be mentioned in particu- 
lar 



20 1 . The average diffusion rate of a fluorescent molecule as a function of the mass 
can be determined in a small sample volume via fluorescence correlation spec- 
troscopy (FCS) (Proc. Natl. Acad. Sci. USA (1994) 1 1753-1 1575). FCS can be 
employed for determining protein/ligand interactions by measuring the change in 
the mass, or the changed diffusion rate which this entails, of a test compound 

25 when binding to CLP. A method according to the invention can be designed di- 

rectly for measuring the binding of a test compound labeled by a fluorescent 
molecule. As an alternative, the method according to the invention can be de- 
signed in such a way that a chemical reference compound which is labeled by a 
fluorescent molecule is displaced by further test compounds ("displacement as- 

30 say"). 



Fluoresence polarization exploits the characteristic of a quiescent fluorophore 
excited with polarized light to likewise emit polarized light. If, however, the fluoro- 
phore is allowed to rotate during the excited state, the polarization of the fluores- 
cent light which is emitted is more or less lost. Under otherwise identical condi- 
tions (for example temperature, viscosity, solvent), the rotation is a function of 
molecule size, whereby findings regarding the size of the fluorophore-bound resi- 
due can be obtained via the reading (Methods in Enzymology 246 (1995). pp. 
283-300). A method according to the invention can be designed directly for 
measuring the binding of a test compound labeled with a fluorescent molecule to 
the CLP. As an alternative, the method according to the invention may also take 
the form of the "displacement assay" described under 1 . 
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3. Fluorescence resonance energy transfer (FRET) is based on the irradiation-free 
energy transfer between two spatially adjacent fluorescent molecules under suit- 
able conditions. A prerequisite is that the emission spectrum of the donor mole- 

5 cule overlaps with the excitation spectrum of the acceptor molecule. The fluores- 

cent label of CLP, and binding test compound, the binding can be measured by 
means of FRET (Cytometry 34, 1 998, pp. 1 59-1 79). As an alternative, the 
method according to the invention may also take the form of the "displacement 
assay" described under 1 . An especially suitable embodiment of FRET technol- 
10 ogy is "Homogeneous Time Resolved Fluorescence" (HTRF) as can be obtained 

from Packard Bioscience. 

4. Surface-enhanced laser desorption/ionfzation (SELDI) in combination with a time- 
of-flight mass spectrometer (MALDLTOF) makes possible the rapid analysis of 

15 molecules on a support and can be used for analyzing protein/ligand interactions 

(Worral et aL, (1998) Anal. Biochem. 70:750-756). In a preferred embodiment. 
CLP, is immobilized on a suitable support and incubated with the test compound. 
After one or more suitable wash steps, the test compound molecules which are 
additionally bound to CLP, can be detected by means of the abovementioned 

20 methodology and test compounds which are bound to CLP, can thus be selected. 

5. The measurement of surface plasmon resonance is based on the change in the 
refractive index at a surface when a test compound binds to a protein which is 
immobilized to said surface. Since the change in the refractive index is identical 

25 for virtually all proteins and polypeptides for a defined change in the mass con- 

centration at the surface, this method can be applied to any protein in principle 
(Lindberg et al. Sensor Actuators 4 (1983) 299-304; Malmquist Nature 361 
(1993) 186-187). The measurement can be carried out for example with the 
automatic analyzer based on surface plasmon resonance which is available from 

30 Biacore (Freiburg) at a throughput of, currently, up to 384 samples per day. A 

method according to the invention can be designed directly for measuring the 
binding of a test compound to CLP. As an alternative, the method according to 
the invention may also take the form of the "displacement assay" described under 
1. 

35 

The compounds identified via the abovementioned methods 1 to 5 may be suitable as 
inhibitors- All of the substances identified via the abovementioned methods can subse- 
quently be checked for their herbicidal action in another embodiment of the method 
according to the invention. 

40 

Furthermore, there exists the possibility of detecting further candidates for herbicidal 
active ingredients by molecular modeling via elucidation of the three-dimensional struc- 
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ture of CLP, by x-ray structure analysis. The preparation of protein crystafs required for 
x-ray structure analysis, and the relevant measurements and subsequent evaluations 
of these measurements, the detection of a binding site in the protein, and the prediction 
of potential inhibitor structures are known to the skilled worker In principle, an optimS- 
5 zation of the compound identified by the abovementioned methods is also possible via 
molecular modeling. 

A preferred embodiment of the method according to the invention, which is based on 
steps i) and ii) ? consists in selecting a test compound which reduces or blocks the activ- 
10 ity of the CLP. Preferably, the activity of the CLP, incubated with the test compound is 
herein compared with the activity of a CLP, not incubated with a test compound. 

A more preferred embodiment of the method based on steps i) and ii) consists in 

15 I expressing CLP in a transgenic organism according to the invention or growing 
an organism which naturally contains a CLP, 

ii. bringing CLP, of step i) in the cell digest of the transgenic or nontransgenic or- 
ganism, in partially purified or in homogeneously purified form, into contact with a 
20 test compound; and 

Hi. selecting a compound which reduces or blocks the activity of the nuclear en- 
coded Clp Protease- Preferably the activity of CLP incubated with the test com- 
pound is herein compared with the activity of a CLP. not incubated with a test 
25 compound. 

The solution containing the CLP, can consist of the lysate of the original organism or of 
the transgenic organism which has been transformed with an expression cassette ac- 
cording to the invention. If necessary, the CLP, can be purified partially or fully via cus- 
30 tomary methods. A general overview over current protein purification techniques is de- 
scribed, for example, in Ausubel, F.M. et al M Current Protocols in Molecular Biology, 
Greene Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. In the 
case of recombinant preparation, the protein which has been fused with an affinity tag 
can be purified via affinity chromatography as is known to the skilled worker. 

35 

The CLP, which is required for in vitro methods can thus be isolated either by means of 
heterologous expression from a transgenic organism according to the invention or from 
an organism containing CLP, for example from an undesired plant, the term "undesired 
plant" being understood as meaning the species mentioned at the outset. 

40 

To identify herbicidal compounds, the CLP, is now incubated with a test compound. 
After a reaction time, the enzymatic activity of the CLP, incubated with the test com- 
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pound is determined in comparison with a CLP, not incubated with a test compound. Jf 
the CLP, is inhibited, a significant decrease in activity in comparison with the activity of 
the noninhibited polypeptide according to the invention is observed, the result being a 
reduction of at feast 10%, advantageously at least 20%, preferably at least 30%, espe- 
5 cially preferably by at least 50%, up to 100% reduction (blocking). Preferred is an inhi- 
bition of at least 50% attest compound concentrations of 10^M, preferably at lO^M, 
especially preferably of lO^M, based on enzyme concentration in the micromolar 
range. 

10 The enzymatic activity of CLP, can be determined for example by an activity assay in 
which the increase of the product, the decrease of the substrate (or starting material) or 
the decrease or increase of the cofactor are determined, or by a combination of at least 
two of the abovementioned parameters, as a function of a defined period of time. 

15 Examples of suitable substrates are, for example small peptides and modified small 

peptides like peptides coupled to a fluorogenic molecule such as aminomethylcoumartn 
and succinylated peptides. 

If appropriate, derivatives of the abovementioned compounds which contain a detect- 
20 able label such as, for example, a fluorescent label (e.g. fluorogenic substrates such as 
N-Suc-Leu-Tyr-(7-amino-4-methylcoumarine) (SLT-AMC), SuoAla-Ala-AJa-AMC. Sue- 
Leu-Leu-Val-Tyr-AMC, Suc-Ala-Ala-Phe-AMC, Suc-lle-lle-Trp-AMC, Suo-Ala-Phe-Lys- 
AMC), a radioisotope label or a chemiluminescent label, may also be used. 

25 The amounts of substrates to be employed in the activity tests may range between 0.5 
and 100 mM # based on 1-100 Mg/mJ enzyme. 
[ 

The activity can be determined for example by tracking Proteolysis fluorimetrically 
when using fluorogenic Peptide substrates analogously to the method described by 
30 Woo et al. 1 989 The Journal of Biological Chemistry 264, pp.208&-2091 , which is 
herein incorporated by reference. 

The activity may also be determined in an ATP-dependent fashion in the presence of 
ClpA, ClpB orClpC Protein as described in Halperin etal. 2001, Planta 213, pp. 614- 
35 619. The preferred Substrate is then b- casein. 

Furthermore the activity may be measured by HPLC and HPLC-MS mehtods detecting 
fragments of the peptides used as substrates. 

40 Another preferred embodiment of the method according to the invention which is based 
on steps i) and iii) consists of the following steps: 
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generating a transgenic organism according to the invention comprising a nucleic 
add sequence according to the invention, wherein CLP is expressed recombi- 
nantly; 

applying a test compound to the transgenic organism of i) and to a nontransgenic 
organism of the same species; 

determining the growth or the viability of the transgenic and the nontransgenic 
organisms after application of the test substance, and 

selecting test compounds which bring about a reduced growth or a limited viabil- 
ity of the nontransgenic organism in comparison with the growth of the transgenic 
organism. 

15 In this context, the difference in growth in step iv) for the selection of a herbicidally ac- 
tive inhibitor amounts to at least 10%, by preference 20%, preferably 30%, especially 
preferably 40% and very especially preferably 50%, 

The transgenic organism in this context is preferably a plant, an alga, a cyanobacte- 

20 rium, for example of the genus Synechocystis or a proteobacterium such as, for exam- 
ple, Magnetococcus sp. MC1 , preferably plants which can be transformed by means of 
customary techniques, such as Arabidopsis thaliana Allium cepa, Ananas comosus, 
Arachis hypogaea, Asparagus officinalis, Beta vulgaris spec, altissima, Beta vulgaris 
spec, rapa, Brassica napus var. napus, Brassica napus var. napobrassica, Brassica 

25 rapa var silvestris, Camellia sinensis, Carthamus tinctorius, Carya illinotnensis. Citrus 
limon. Citrus sinensis. Coffea arabica (Coffea canephora, Coffea liberica), Cucumis 
sativus, Cynodon dactylon, Daucus carota, Elaeis guineensis, Fragaria vesca, Glycine 
max, Gossypium hirsutum, (Gossypium arboreum, Gossypium herbaceum, Gossypium 
vitifolium), Helianthus annuus, Hevea braslliensis, Hordeum vulgare, Humulus lupulus, 

30 Ipomoea batatas, Juglans regia, Lens culinaris, Linum usitatissimum, Lycopersicon 

lycopersicum, Malus spec, Manihot esculenta, Medicago sativa, Musa spec. Nicotiana 
tabacum (N.rustica), Olea europaea, Oryza sativa, Phaseolus lunatus, Phaseolus vul- 
garis, Picea abies, Pinus spec, Pisum sativum, Prunus avium, Prunus pereica, Pyrus 
communis, Ribes sylvestre. Ricinus communis, Saccharum officinarum, Secale ce- 

35 reale, Solanum tuberosum, Songhum bicolor (s. vulgare), Theobroma cacao, Trifolium 
pratense, Triticum aestivum, Triticum durum, Vicia faba, Vitis vinlfera, Zea mays, or 
cyanobacteria which can be transformed readily, such as Synechocystis, into which the 
sequence encoding a polypeptide according to the invention has been incorporated by 
transformation. These transgenic organisms thus show increased tolerance to com- 

40 pounds which inhibit the polypeptide according to the invention. "Knock-out" mutants in 
which the analogous CLProtease gene which is naturally present in this organism has 
been selectively switched off may also be used. 
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However, the abovementioned embodiment of the method according to the invention 
can also be used for identifying substances with a growth-regulatory action. In this con- 
text, the transgenic organism employed is a plant. The method for identifying sub- 
5 stances with growth-regulatory activity thus comprises the following steps: 

i. generating a transgenic plant comprising a nucleic acid sequence according to 
the invention encoding CLP f wherein CLP is expressed recombinantly; 

10 ii. applying a test substance to the transgenic plant of i) and to a nontransgenic 
ptant of the same variety, 

iii. determining the growth or the viability of the transgenic plant and the nontrans- 
genic plant after application of the test compound, and 

15 

iv. selecting test substances which bring about a reduced growth of the nontrans- 
genic plant in comparison with the growth of the transgenic plant 

Here, step iv) involves the selection of test compounds which bring about a modified 
20 growth of the nontransgenic organism in comparison with the growth of the transgenic 
organism. Modified growth is understood as meaning, in this context, inhibition of the 
vegetative growth of the plants, which can manifest itself in particular in reduced longi- 
tudinal growth. Accordingly, the treated plants show stunted growth; moreover, their 
leaves are darker in color. In addition, modified growth is also understood as meaning 
25 a change in the course of maturation over time, the inhibition or promotion of lateral 

branched growth of the plants, shortened or extended developmental stages, increased 
standing ability, the growth of larger amounts of buds, flowers, leaves, fruits, seed ker- 
nels, roots and tubers, an increased sugar content in plants such as sugarbeet, sugar 
cane and citrus fruit, an increased protein content in plants such as cereals or soybean, 
30 or stimulation of the latex flow in rubber trees. The skilled worker is familiar with the 
detection of such modified growth. 

It is also possible, in the method according to the invention, to employ a plurality of test 
compounds in a method according to the invention. If a group of test compounds affect 

35 the target, then it is either possible directly to isolate the individual test compounds or 
to divide the group of test compounds into a variety of subgroups, for example when it 
consists of a multiplicity of different components, in order to thus reduce the number of 
the different test compounds in the method according to the invention. The method 
according to the invention is then repeated with the individual test compound or the 

40 relevant subgroup of test compounds. Depending on the complexity of the sample, the 
above-described steps can be earned out repeatedly, preferably until the subgroup 
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identified in accordance with the method according to the invention only comprises a 
small number of test compounds, or indeed just one test compound. 

All of the above-described methods for identifying inhibitors with herbicidai or growth- 
5 regulatory activity are hereinbelow referred to as "methods according to the invention". 

All of the compounds which have been identified via the methods according to the in- 
vention can subsequently be tested in vivo for their herbicidai and growth-regulatory 
activity. One possibility of testing the compounds for herbicidai action is to use duck- 
10 weed, Lemna minor, in microliter plates. Parameters which can be measured are 
changes in the chlorophyll content and the photosynthesis rate. It is also possible to 
apply the compound directly to undesired plants, it being possible to identify the herbi- 
cidai action for example via restricted growth. 

1 5 The method according to the invention can advantageously also be carried out in high- 
throughput methods, known as HTS, which makes possible the simultaneous testing of 
a multiplicity of different compounds. 

The use of supports which contain one or more of the nucleic acid molecules according 
20 to the invention, one or more of the vectors containing the nucleic acid sequence ac- 
cording to the invention, one or more transgenic organisms containing at least one of 
the nucleic acid sequences according to the invention or one or more (poly)peptides 
encoded via the nucleic acid sequences according to the invention lends itself to carry- 
ing out UTS in practice. 

25 

Supports which contain one or more of the NCLP sequences, one or more of the vec- 
tors comprising the NCLP sequences one or more transgenic organisms containing at 
least one NCLP sequences or one or more (polypeptides encoded by the NCLP se- 
quences are part of the present invention. 

30 

The support used can be solid or liquid, but is preferably solid and especially preferably 
a microtiter plate. The abovementioned supports also form part of the subject matter of 
the present invention. In accordance with the most widely used technique, 96-weli, 
384-well and 1536-well microtiter plates which, as a rule, can comprise volumes of 200 
35 □!, are used. Besides the microtiter plates, the further components of an HTS system 
which match the corresponding microtiter plates, such as a large number of instru- 
ments, materials, automatic pipetting devices, robots, automated plate readers and 
plate washers, are commercially available. 

40 In addition to the HTS systems based on microtiter plates, what are known as Tree- 
format assays" or assay systems where no physical barriers exist between the sam- 
ples, as described, for example, in Jayaickreme etah, Proc. Natl. Acad. Sci U.S.A. 19 
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(1994) 161418; Chelsky, "Strategies for Screening Combinatorial Libraries", First An- 
nual Conference of The Society for Biomolecular Screening in Philadelphia, Pa. (Nov. 
710, 1995); Salmon et al„ Molecular Diversity 2 (1996), 5763 and US 5,976,813, may 
also be used. 

5 

The invention furthermore relates to herbicidally active compounds identified by the 
methods according to the invention. These compounds are hereinbelow referred to as 
"selected compounds". They have a molecular weight of less than 1000 g/mol, advan- 
tageously less than 500 g/mol. preferably less than 400 g/moi, especially preferably 
10 less than 300 g/mol. Herbicidally active compounds have a Ki value of less than 1 mM, 
preferably less than 1 uM, especially preferably less than 0.1 uM, very especially pref- 
erably less than 0.01 uM. 

The invention furthermore relates to compounds with growth-regulatory activity identi- 
15 fled by the methods according to the invention. These compounds too are hereinbelow 
referred to as "selected compounds". 

Naturally, the selected compounds can also be present in the form of their agriculturally 
useful salts. Agriculturally useful salts which are suitable are mainly the salts of those 
20 cations, or the acid addition salts of those acids, whose cations, or anions, do not ad- 
versely affect the herbicidal action of the herbicidally active compounds identified via 
the methods according to the invention. 

If the selected compounds contain asymmetrically substituted D-carbon atoms, they 
25 may furthermore also be present in the form of racemates. enantlomer mixtures, pure 
enantiomers or. if they have chiral substituents, also In the form of diastereomer mix- 
tures. 

The selected compounds can be chemically synthesized substances or substances 
30 produced by microbes and can be found, for example, in cell extracts of, for example, 
plants, animals or microorganisms. The reaction mixture can be a cell-free extract or 
comprise a cell or cell culture. Suitable methods are known to the skilled worker and 
are described generally for example in Alberts, Molecular Biology the cell. 3rd Edition 
(1994), for example chapter 17. The selected compounds may also originate from 
35 comprehensive substance libraries. 

Candidate test compounds can be expression libraries such as, for example. cDNA 
expression libraries, peptides, proteins, nucleic acids, antibodies, small organic sub- 
stances, hormones. PNAs or the like (Milner, Nature Medicin 1 (1995), 879-880; Hupp. 
40 Cell. 63 (1995), 237-245; Gibbs. Cell. 79 (1994), 193-198 and references cited 
therein). 
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The selected compounds can be used for controlling undesired vegetation and/or as 
growth regulators. Herbicidal compositions comprising the selected compounds afford 
very good control of vegetation on noncrop areas. In crops such as wheat, rice, maize, 
soybean and cotton, they act against broad-leaved weeds and grass weeds without 
5 inflicting any significant damage on the crop plants. This effect is observed in particular 
at low application rates. The selected compounds can be used for controlling the harm- 
ful plants which have already been mentioned above. 

Depending on the application method in question, selected compounds, or herbicidal 
10 compositions comprising them, can advantageously also be employed in a further 

number of crop plants for eliminating undesired plants. Examples of suitable crops are: 

Allium cepa, Ananas comosus, Arachis hypogaea, Asparagus officinalis, Beta vulgaris 
spec, altissima, Beta vulgaris spec, rapa, Brassica napus var. napus, Brassica napus 

15 var. napobrassica, Brassica rapa var. silvestris. Camellia sinensis, Carthamus tincto- 
rius, Carya illinoinensis, Citrus limon, Citrus sinensis, Coffea arabica (Coffea can- 
ephora, Coffea liberica), Cucumis sativus, Cynodon dactylon, Oaucus carota, Elaeis 
gufneensis, Fragaria vesca, Glycine max, Gossypium hirsutum, (Gossypium arboreurn, 
Gossypium herbaceum, Gossypium vitifolium), Helianthus annuus, Hevea brasiliensis, 

20 Hordeum vulgare, Humulus lupulus, Ipomoea batatas, Juglans regia, Lens culinaris, 
Linum usitatissimum, Lycopersicon lycopersicum, Malus spec, Manihot esculenta, 
Medicago satfva, Musa spec, Nicotiana tabacum (N.rustica), Olea europaea, Oryza 
sativa, Phaseolus lunatus, Phaseolus vulgaris, Picea abies, Pinus spec, Pisum sati- 
vum, Prunus avium, Prunus persica, Pyrus communis, Ribes sylvestre, Ricinus com- 

25 munis, Saccharum officinarum, Secale cereale, Solanum tuberosum, Sorghum bicolor 
<s. vulgare), Theobroma cacao, Trifolium pratense, Triticum aestivum, Triticum durum, 
Vicia faba, Vitis vinifera, Zea mays. 

In addition, the selected compounds can also be used in crops which tolerate the ao 
30 tion of herbicides owing to breeding, including recombinant methods. The generation of 
such crops is described hereinbelow. 

The invention furthermore relates to a method of preparing the herbicidal or growth- 
regulatory composition which has already been mentioned above, which comprises 
35 formulating selected compounds with suitable auxiliaries to give crop protection prod- 
ucts. 

The selected compounds can be formulated for example in the form of directly spray- 
able aqueous solutions, powders, suspensions, also highly concentrated aqueous, oily 
40 or other suspensions or suspoemulsions or dispersions, ernuisifiabte concentrates, 
emulsions, oil dispersions, pastes, dusts, materials for spreading or granules, and ap- 
plied by means of spraying, atomizing, dusting, spreading or pouring. The use forms 
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depend on the intended use and the nature of the selected compounds; in any case, 
they should guarantee the finest possible distribution of the selected compounds. The 
herbicidal compositions comprise a herbicidally active amount of at least one selected 
compound and auxiliaries conventionally used in the formulation of herbicidal composi- 
5 tions. 

For the preparation of emulsions, pastes or aqueous or oily formulations and dispersi- 
b!e concentrates (DC), the selected compounds can be dissolved or dispersed in an oil 
or solvent, it being possible to add further formulation auxiliaries for homogenization. 

10 However, it is also possible to prepare liquid or solid concentrates from selected com- 
pound, if appropriate solvents or oil and, optionally, further auxiliaries comprising liquid 
or solid concentrates, and these concentrates are suitable for dilution with water. The 
following can be mentioned: emulsifiable concentrates (EC, EW), suspensions (SC), 
soluble concentrates (SL), dispersible concentrates (DC), pastes, pills, wettable pow- 

1 5 ders or granules, it being possible for the solid formulations either to be soluble or dis- 
persible (wettable) in water. In addition, suitable powders or granules or tablets can 
additionally be provided with a solid coating which prevents abrasion or premature re- 
lease of the active ingredient. 

20 In principle, the term "auxiliaries" is understood as meaning the following classes of 
compounds: antifoams, thickeners, wetting agents, tackifiers, dispersants, emulsifiers, 
bactericides and/or thixotropic agents. The skilled worker is familiar with the meaning of 
the abovementioned agents. 

25 SLs, EWs and ECs can be prepared by simply mixing the ingredients in question; pow- 
ders can be prepared by mixing or grinding in specific types of mills (for example ham- 
mer mills). DCs, SCs and SEs are usually prepared by wet milling, it being possible to 
prepare an SE from an SC by addition of an organic phase which may comprise further 
auxiliaries or selected compounds. The preparation is known. Powders, materials for 

30 spreading and dusts can advantageously be prepared by mixing or cog rinding the ac- 
tive substances together with a solid carrier. Granules, for example coated granules, 
impregnated granules and homogeneous granules, can be prepared by binding the 
selected compounds to solid carriers. The skilled worker is familiar with further details 
regarding their preparation, which are mentioned for example in the following publica- 

35 tions: US 3,060,084, EP-A 707445 (for liquid concentrates), Browning, "Agglomera- 
tion", Chemical Engineering, Dea 4, 1967, 147-48, Perry's Chemical Engineer's Hand- 
book, 4th Ed., McGraw-Hill, New York, 1963, pages 8-57 and et seq. WO 91/13546, 
US 4,172.714, US 4,144,050, US 3,920,442, US 5,180,587, US 5,232,701 , US 
5,208,030, GB 2,095,558, US 3,299,566, Klingman, Weed Control as a Science, John 

40 Wiley and Sons, Inc., New York, 1961, Hance etal. t Weed Control Handbook, 8th Ed,, 
Blackwell Scientific Publications, Oxford, 1989 and Mollet, R r Grubemann, A., Formu- 
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lation technology, Wiley VCH Verfag GmbH. Weinheim (Federal Republic of Germany) 
2001 ' 



40 



The skilled worker is familiar with a multiplicity of inert liquid and/or solid carriers which 
> are suitable for the formulations according to the invention, such as, for example, liquid 
additives such as mineral oil fractions of medium to high boiling point such as kerosene 
or diesel oil, furthermore coal tar oils and oils of vegetable or animal origin, aliphatic 
cyclic and aromatic hydrocarbons, for example paraffin, tetrahydrophthalene, alkylated 
naphthalenes or their derivatives, alkylated benzenes or their derivatives, alcohols such 
as methanol, ethanol, propanol, butanol and cyclohexanol, ketones such as cyclohexa- 
none, or strongly polar solvents, for example amines such as N-methylpyrrolidone or 
water. 

Examples of solid carriers are mineral earths such as silicas, silica gels, silicates, talc 
kaolin, limestone, lime, chalk, bole, loess, clay, dolomite, diatomaceous earth, calcium 
sulfate, magnesium sulfate, magnesium oxide, ground synthetic materials, fertilizers 
such as ammonium sulfate, ammonium phosphate, ammonium nitrate, ureas and 
products of vegetable origin such as cereal meal, tree bark meal, wood meal and nut- 
shell meal, cellulose powders or other solid earners. 

The skilled worker is familiar with the multiplicity of surface-active substances (surfac- 
tants) which are suitable for the formulations according to the invention such as, for 
example, alkali metal salts, alkaline earth metal salts or ammonium salts of aromatic 
sulfonic acids for example lignosulfonic acid, phenolsulfonic acid, naphthalenesulfonic 
acid, and dibutylnaphthalenesulfonic acid, and of fatty acids, of alkyl- and alkylarylsuf- 
fonates. of alkyl sulfates, lauryl ether sulfates and fatty alcohol sulfates, and salts of 
sulfated hexa-, hepta- and octadecanols and of fatty alcohol glycol ethers, condensates 
of sulfonated naphthalene and its derivatives with formaldehyde, condensates of naph- 
thalene or of the naphthalenesulfonic acids with phenol and formaldehyde, poly- 
oxyethylene octylpheno! ether, ethoxylated isooctyl-, octyl- or nonylphenol. alkylphenyl 
polyglycol ethers, tributylphenyl polyglycol ether, alkylaryl polyether alcohols, isotridecyl 
alcohol, fatty alcohol/ethylene oxide condensates, ethoxylated caster oil, polyoxyethyh- 
ene alkyl ethers or polyoxypropylene alkyl ethers, lauryl alcohol polyglycol ether ace- 
tate, sorbitol esters, lignosulfite waste liquors or methylcellulose. 

The herbicidal compositions, or the selected compounds, can be applied pre- or post- 
emergence. If the selected compounds are less well tolerated by certain crop plants 
application techniques may be used in which the selected compounds are sprayed 
with the aid of the spraying apparatus, in such a way that they come into as little con- 
tact, if any, with the leaves of the sensitive crop plants while the selected compounds 
reach the leaves of undesired plants which grow underneath, or the bare soil surface 
(post-directed, lay-by). 
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Depending on the intended purpose of the control measures, the season, the target 
plants and the growth stage, the application rates of selected compounds amount to 
0.001 to 3.0. preferably 0.01 to 1.0 kg/ha. 

5 

The invention is illustrated in greater detail by the examples which follow, which are not 
to be considered as limiting. 

General DNA manipulation and cloning methods 

10 

Cloning methods such as, for example, restriction cleavages, agarose gel electropho- 
reses. purification of DNA fragments, transfer of nucleic acids to nitrocellulose and ny- 
lon membranes, linking DNA fragments, transformation of Escherichia coli cells, grow- 
ing bacterium and sequence analyses of recombinant DNA were carried out as de- 
1 5 scribed by Sambrook et al. (1969) (Cold Spring Harbor Laboratory Press: ISBN 0- 

87969-309-6) and Ausubel, P.M. et al., Current Protocols in Molecular Biology, Greene 
Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. 

Molecular-biological standard methods for plants and plant transformation methods are 
20 described in Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publish- 
ers (1998). Reither et al., Methods in Arabidopsis Research, World scientific press 
(1992) and Arabidopsis: A Laboratory Manual (2001), ISBN 0-87969-573-0. 

The bacterial strains used hereinbelow (E. coli DH5, XL-1 blue) were obtained from 
25 Stratagene, BRL Gibco or Invitrogen. Carlsberg, CA The vectors used for cloning were 
pUC 19 from Amersham Pharmacia (Freiburg) and the vector pBinAR (Hofgen and 
Willmitzer. Plant Science 66, 1990, 221-230). 

Example 1: Generation of a cDNA library in the plant transformation vector 

30 

To generate a cDNA library (hereinbelow termed "binary cDNA library") in a vector 
which can be used directly for transforming plants, mRNA was isolated from a variety 
of plant tissues and transcribed into double-stranded cDNA using the cDNA Synthese 
Kit (Amersham Pharmacia Biotech, Freiburg). The cDNA first-strand synthesis was 

35 carried out using T1 2-1 8 oligonucleotides following the manufacturer's instructions. 
After size fractionation and the ligation of EcoRI-Notl adapters following the manufac- 
turer's instructions and filling up the overhangs with Pfu DNA polymerase (Stratagene), 
the cDNA population was normalized. The method of Kohci et al, 1995, Plant Journal 8, 
771-776 was followed, the cDNA being amplified by PCR with the oligonucleotide N1 

40 under the conditions given in Table 1 . 

Table 1 
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Time [sec] 


Number of cycles 


y4 


300 


1 




8 


10 




60 




72 


180 




94 


8 


10 


50 


60 




72 


180 




94 


8 


10 


48 


60 




72 


180 




72 


420 


1 



The resulting PCR product was bound to the column matrix of the PCR purification kit 
5 (Qiagen, Hilden) and eluted with 300 mM NaP buffer, pH 7.0, 0.5 mM EDTA, 0.04% 
SDS. The DNA was denatured for 5 minutes fn a boiling water bath and subsequently 
renatured for 24 hours at 60oC. 50ul of the DNA were applied to a hydroxylapatite col- 
umn and the column was washed 3 times with 1 ml of 10 mM NaP buffer, pH 6.8. The 
bound single-stranded DNA was eluted with 130 mM NaP buffer, pH 6.8, precipitated 
1 0 with ethanol and dissolved in 40 pi of water. 20 pi of growth were used for a further 
PCR amplification as described above. After further ssDNA concentration, a third PCR 
ampJiftcation was carried out as described above. 

The plant transformation vector for taking up the cDNA population which had been 
15 generated as described above was generated via restriction enzyme cleavage of the 
vector pUC18 with Sbfl and BamHI, purification of the vector fragment followed by fill- 
ing up the overhangs with Pfu DNA polymerase and relegation with T4 DNA ligase 
(Stratagene). The resulting construct is hereinbelow termed pUC1 8Sbf K 

20 The vector pBinAR was first cleaved with Notl, the ends were filled up and the vector 
was relegated, cleaved with Sbfl, the ends were filled up and the vector was relegated 
and subsequently cleaved with EcoRI and HindHI. The resulting fragment was ligated 
into a derivative of the binary plant transformation vector pPZP (Hajdukiewicz,P, Svab, 
2, Maliga. P., (1994) Plant Mol Biol 25:989-994) which makes possible the transforma- 
tion of plants by means of agrobacterium and mediates kanamycin resistance in trans- 
genic plants. The construct generated thus is hereinbelow termed pSun12/35S. 

pUC18Sbfl- was used as template in a polymerase chain reaction (PCR) with the oli- 
gonucleotides V1 and V2 (see Table 2) and Pfu DNA polymerase. The resulting frag- 
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ment was ligated into the Smal-cut pSunl2/35S, giving rise to pSunb!ues2. Following 
cleavage with Notl, dephosphorylation with shrimp alkaline phosphatase (Roche Diag- 
nostics. Mannheim) and purification of the vector fragment, pSunblues2 was ligated 
with the normalized, likewise Notl-cut cDNA population. Following transformation into 
5 E.coli XMblue (Stratagene), the resulting clones were deposited into microtiter plates. 
The binary cDNA library contains cDNAs in "sense"- and in "antisense" orientation un- 
der the control of the cauliflower mosaic virus 35S promoter, and, after transformation 
into tobacco plants, these cDNAs can, accordingly, lead to "cosuppression" and "an- 
tisense" effects. 

10 

Table 2: Oligonucleotides used 



Oligonucleotide 


Nucleic acid sequence 


N1 


5-AGAATTCGCGGCCGCT-3' (SEQ ID NO:23) 


V1 (PWL93not) 


5'-CTCATGCGGCCGCGCGCAACGCAA7TAATGTG-3' (SEQ 
ID NO:24) 


V2 (pWL92) 


5'-TCATGCGGCCGCGAGATCCAGTTCGATGTAAC-3' (SEQ 
ID NO:25) 


G1 (35S) 


5'-GTGGATTGATGTGATATCTCG"3' (SEQ ID NO:26) 


G2 (OCS) 


5-GTAAGGATCTGAGCT ACACAT-3' (SEQ ID NO:27) 



15 Example 2: Transformation and analysis of tobacco plants 

Selected clones of the binary cDNA library were transformed into Agro bacterium tume- 
faciens C58C1:pGV2260and (Deblaere et al„ Nucl. Acids. Res, 13(1984), 4777-4788) 
and incubated with Streptomycin/Spectinomyan selection. The material used for the 

20 transformation of tobacco plants (Nicotiana tabacurn cv. Samsun NN) with one of the 
binary clones as depicted in table 3 was an overnight culture of a positively trans- 
formed agrobacterial colony diluted with YEB medium to OD600 - 0.8-1.6. Leaf discs 
of sterile plants (approx. 1 cm2 each) were incubated for 5-10 minutes with a 1:50 
agrobacterial dilution in a Petri dish. This was followed by incubation in the dartc for 2 

25 days at 25°C on Murashige-Skoog medium (Physiol. Plant. 15(1962), 473) supple- 
mented with 2% sucrose (2MS medium) and 0.8% Bacto agar. The cultivation was con- 
tinued after 2 days at a 164iour-light/84iour-darkness photoperiod and continued in a 
weekly rhythm on MS medium supplemented with 500mg/l Claforan (cefotaxime so- 
dium), 50mg/l kanamycin, 1mg/| benzylaminopurin (BAP). 0.2mg/l naphthylacetic acid 

30 and 1.6g/l glucose. Regenerated shoots were transferred onto an MS medium supple- 
mented with kanamycin and Claforan. Transgenic plants of lines as depicted in table 3 
were generated in this manner. 

Table 3: Plant lines generated 
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Partial cDNA with pheno- 
type In transgenic tobacco 


Plant line 


Corresponding 
full length cDNA 


Function 


SEQ ID NO: 20 


E_J)000013511 


SEQ ID NO: 3 


ClpP2 protease 


SEQ ID NO:19 


E_0000008893 


SEQ ID NO:11 


ClpP5=ClpR1 pro- 
tease 


SEQ ID NO:21 


E_0000012393 




ClpP6 protease 






SEQ ID NO: 17 


ClpP-like protease 



The integration of the clone cDNA into the genome of the transgenic lines was detected 
via PCR with the oligonucleotides G1 and G2 (see Table 2) and genomic DNA pre- 
pared from the transgenic lines in question. To this end, TAKARA Taq DNA poly- 
5 merase was preferably employed for this purpose, following the manufacturer's instruc- 
tions (MoBiTec, Gottingen). The cDNA clone of the binary cDNA library, which clone 
had been used for the transformation, acted as template for a PCR reaction as the 
positive control. PCR products with an identical size or, if appropriate, identical cleav- 
age patterns which were obtained after cleavage with a variety of restriction enzymes 
10 acted as proof that the corresponding cDNA had been integrated. In this manner, the 
insert of clones were detected in the respective transgenic plant lines (as depicted in 
table 3) with the belowmentioned phenotypes. 

After the shoots had been transferred into soil, the plants were observed for 2-20 
15 weeks In the greenhouse for the manifestation of phenotypes. It emerged that trans- 
genic plants of lines E_0000012393, E_0000013511 and E_0000008893 were similar 
in phenotype. The plants showed severe chlorosis and concomitant growth retardation 
with respect to wild type plants after 2 weeks. 

20 Example 3: Sequence analysis of the clones 

SEQ ID NO:19 was fully sequenced and used for the detection of the corresponding full 
length clone SEQ ID NO:1 1 . SEQ ID NO:1 1 is identical to nt0Q2Q50O71r SEQ ID 
NO:19 in the overlapping region. An open reading frame of 867 nt (pos. 2-1 162) en- 

25 codes for 387 amino acids (SEQ ID NO: 4) with highest identity to-ClpR1 from Arabi- 
dopsrs thaliana-. Sequence homology suggests that the 5'-ends of SEQ ID NO: 11 and 
ClpR1 from Arabidopsis thaliana are very diverse and that nt006066004r is close to 
being toll size with respect to ClpR1. MS-Analysis of isolated CIpP Proteins from Arabi- 
dopsis indicate, that the mature ClpR1 is several kDal shorter as suggested by the 

30 cDNA Sequence (Peltier et al. 2001 , The Journal of Biological Chemistry 276, 
99.16318-16327). 

SEQ ID NO:20 was fully sequenced and used for the detection of the corresponding full 
length clone, SEQ id NO: 3, SEQ ID NO:3 is identical to SEQ ID NO:20 in the ovedap- 
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ping region. An open reading frame of 867 nt <pos. 1 1-877) encodes for 289 amino 
acids (SEQ ID NO:4) with highest identity to ClpP2 from Arabidopsis thaliana. 

SEQ ID NO:21 was fully sequenced . The partial cDNA Sequence of 602 nt contains an 
5 open reading frame of 1 86 nt (nt 8-1 93) encoding for 62 amico acids (SEQ ID NO:22). 
This partial polypeptide shows highest identity to ClpP6 from Arabidopsis thaliana 
(SEQ ID NO:9) 

A further ClpP-homolog cDNA of 906 nt (SEQ ID N0:17) was identified. An open read- 
1 0 ing frame of 71 1 nt (pos. 45-755) encodes for 237 amino acids (SEQ ID NO:1 8) with 
highest identity to a ClpP-like protein from Arabidopsis thaliana (GeneBank Acc. No 
AK1 18523). 

Thus, it was shown for the first time and in a surprising manner that the natural expres- 
1 5 sion of nuclear encoded Clp protease encoding genes is essential for plants and that 
reduced expression leads to damage as depicted by the phenotypes mentioned in Ex- 
ample 2 demonstrating the suitability of nuclear encoded dp-proteases as target for 
herbicides. 

20 Example 4: Expression in E.coli 

In order to generate active protein with nuclear encoded Clp-protease activity frag- 
ments of SEQ ID NO:1 1 , -SEQ ID NO:3 and of SEQ ID NO:9 were subcloned into the 
expression vector P QE60 (Quiagen, Hiiden, Germany). To this end the oligonucleotides 

25 displayed in tab. 4 where used to amplify via polymerase chain reaction cDNA frag- 
ments that contain Ncol and Bglll restriction sites. The PGR was carried out in 36 cy- 
cles following standard conditions (for example as described by Sambrook, J. et al. 
(1989) "Molecular cloning: A laboratory manual", Cold Spring Harbor Laboratory 
Press), the annealing temperatures being between 45 and 55°C and the polymerization 

30 time being in each case 60 seconds per 1 0OObp. Cutting the cDNA fragments with Ncol 
and Bglll restriction enzymes and ligation into pQE60 cut with the same enzymes de- 
livered expression plasmids that were transformed into E. coli. Expression was per- 
formed in E. coli TOP 10F strains (Invitrogen, Karlsruhe, Germany) following induction 
with IPTG. Standard protocols (Invitrogen) were followed. 

35 

Enzyme preperations were achieved by breaking cells in a French-Press in 100mM 
Tris/HCI, pH 7.4, 2.5 mM EDTA, 1% Triton X-100, 

The expression products were purified by affinity chromatography on Ni-agarose where 
40 appropriate. The manufacturer's instructions were followed (Qiagen). 

Table 4 
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Construct 


Primer (Nucleic acid sequence) 


NT_apR1 ,J 


5'-TATACCATGGATTTGCCATCTTTG-3' (SEQ ID NO:28) 
5-ATAGATCTCACCTGGAGCCAG-3' {SEQ JD NO:29) 


Nt_ClpP2 1) 


5-GAGCCCATGGCAAGAGGAG -3' (SEQ ID NO:30) 
5-ATAGATCTTTCTAGCTTGAACC-3' (SEQ ID NO:31) 


AT_CipP6 Z) 


5'-TCAGCCATGGCCCCTGGAGGAC-3*(SEQ ID NO:32) 
5'-TAAGATCTTCAGTATTCTGTTTCC-3' (SEQ ID NO:33) 



1) Template: Nicotfana tabacum cDNA library 

2) Template: Arabidopsis thaliana cDNA library 



5 Example 5: Activity assay 

Isolated ClpP activity can be measured as described (Woo et at. 1989 The Journal of 
Biological Chemistiy 264 T pp.2088-2091) by using ffuoregenic substrates such as N- 
Suc-Leu-Tyr-(7-amino-4-methylcournarine) (SLT-AMC). The proteolytic cleavage delib- 
1 0 erates 7-amlno4-methylcoumarin r which can be detected fluorimetrically (emission at 
460nm by exitation at 390 nm). 

Standard assays contain: 50mM Tris/HCI, pH 8.0, 25mM MgCI 2 , 1mM SLT-AMC and 1- 
100 pg ClpP Enzyme. 

15 The assay is suitable in for high throughput screening in 96well and 384 well format 



Sequence Listing 



20 


Sequence 


Function 


Organism 




SEQ ID NO:1 {nucleic acid sequence) 


ClpP1 


Arabidopsis thanliana 


25 


SEQ ID NO:2 (amino acid sequence) 


CIpP1 


Arabidopsis thanliana 


SEQ ID NQ:3 (nucleic add sequence) 


CIpP2 


Nicotiana tabacuum 




SEQ ID NO:4 (amino acid sequence) 


ClpP2 


Nicotiana tabacuum 


30 


SEQ ID NO:5 (nucleic add sequence) 


ClpP3 


Arabidopsis thanliana 




SEQ ID NO:6 (amino acid sequence) 


ClpP3 


Arabidopsis thanliana 




SEQ ID NO:7 (nucleic acid sequence) 


ClpP4 


Arabidopsis thanliana 



35 
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SEQ ID NO:8 (amino acid sequence) ClpP4 
SEQ ID NO:9 (nudeic acid sequence) CIpPS 
5 SEQ ID NO:10 (amino add sequence) ClpP6 
SEQ ID N0:1 1 (nucleic acid sequence) ClpR1 
SEQ ID NO: 12 (amfno add sequence) ClpR1 
SEQ ID NQ:13 (nudeic add sequence) ClpR3 
SEQ ID NO: 14 (amino acid sequence) ClpR3 
15 SEQ ID NO:15 (nucleic acid sequence) ClpR4 
SEQ ID NO: 1 6 (amino add sequence) ClpR4 
SEQ ID NO:17 (nudeic add sequence) CIpP like 
SEQ ID NO:18 (amino acid sequence) CIpP like 



20 



25 



SEQ ID NO: 19 (nucleic acid sequence) ClpR1 
(fragment) 

SEQ ID NO:20 (nucleic acid sequence) ClpP2 
(fragment) 



Arabidopsis thanliana 



Arabidopsis thanliana 



Arabidopsis thanliana 



Nicotians tabacuum 



Nicotiana tabacuum 



Arabidopsis thanliana 



Arabidopsis thanliana 
Arabidopsis thanliana 



Arabidopsis thanliana 



Arabidopsis thanliana 



Arabidopsis thanliana 



Nicotiana tabacuum 



Nicotiana tabacuum 



SEQ ID NO:2l (nucleic acid sequence) ClpP6 Nicotiana tabacuum 
30 (fragment) 



SEQ ID NO:22 (amino acid sequence) ClpP6 
(fragment) 



Nicotiana tabacuum 



35 SEQ ID NO:23-33: Primer (nucleic add sequences) 



ACQ TFRRWaQ T.^Q GfcM- 



BASF Aktiengesellschaft 



20030949 
51 



PF 55117 EP 



We claim: 
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1 . "TTie use of nuclear encoded Clp-protease in a method for identifying herbicides, 

2. The use as claimed in claim 1 , wherein the Clp-protease is 

a) selected from the group consisting of ClpP1 -protease, ClpP2-protease. 
ClpP3-protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease. ClpR3-proteas ei 
ClpR4-protease; or 

c) ClpP-like-protease. 

3. A plant nucleic acid sequence encoding a ClpP2-protease comprising: 

a) a nucleic add sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino add sequence shown in SEQ ID 
NO:4 by backtranslating. or 

c) a functional equivalent of nudeic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at (east 66%. 

4. A plant nucleic add sequence encoding a ClpR1 -protease comprising: 

a) a nudeic acid sequence with the nudeic acid sequence shown in SEQ ID 
NO;11,or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 1 2 by backtranslating, or 

c) a fundional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 69%. 

40 5. A plant nudeic acid sequence encoding a ClpP-like-protease comprising: 



25 
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a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:l7,or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 18 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:17 
which has an identity with SEQ ID NO:17 of has at least 67%, 

6. A polypeptide with the activity of a nuclear encoded Clp-protease f encoded by a 
nucleic acid molecule as claimed in claim 3, 4 or 5. 



7. An expression cassette comprising 

a) genetic control sequences in operable linkage with a nucleic acid sequence 
as claimed in claim 3, 4 or 5; or 

b) additional functional elements, or 

c) a combination of a) and b). 

8. A vector comprising an expression cassette as claimed in claim 7. 

25 9. A transgenic organism comprising at least one nucleic acid sequence as claimed 
in claim 4, 5 or 6 encoding a polypeptide with the activity of a CIp-protease, an 
expression cassette as claimed in claim 7 or a vector as claimed in claim 8, se- 
lected from among bacteria, yeasts, fungi, animal cells or plant cells. 

30 10. A method for identifying substances with herbicidal activity, comprising the follow- 
ing steps: 

I bringing a nuclear encoded CIp-protease into contact with one or more test 
compounds under conditions which permit the test compound(s) to bind to 
35 the nucleic acid molecule encoding CIp-protease or to the nuclear encoded 

CIp-protease, and 

ii. detecting whether the test compound binds to the CIp-protease of i), or 

40 iii- detecting whether the test compound reduces or blocks the enzymatic or 

biological activity of the CIp-protease of i), or 
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iv. detecting whether the test compound reduces or blocks the transcription 
translation or expression of the Clp-protease of i). 

11. A method as claimed in claim 1 0, wherein the Clp-protease is 

o 

a) selected from the group consisting of CIpPLprotease. ClpP2-protease, 
ClpP3-protease, ClpP4-protease and ClpP6-protease; or 

in b> fr ° m the flr ° Up consistin 9 of ClpR1-protease, ClpR3-protease, 

ClpR4-protease; or 

c) ClpP-Jike-protease. 

12. A method as claimed in claim 10, wherein 



15 

a) the ClpPi-protease is encoded by a nucleic acid sequence which com- 
prises: 

0 a nucleic acid sequence with the nucleic acid sequence shown in 
20 SEQIDNO:1,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:2 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:1 which has an identity with SEQ ID NO:1 of has at least 50%; 

b) the ClpP2-protease is encoded by a nucleic acid sequence which com- 
prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQIDNO:3,or 



25 



30 



35 if) 



40 



iii) 



a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:4 by back translating, or 

a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:3 which has an identity with SEQ ID NO:3 of has at least 50%; 
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c) the ClpP3-protease is encoded by a nucleic acid sequence which com- 
prises: 

i) a nucJefc acid sequence with the nucleic acid sequence shown in 
5 SEQ ID NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:6 by back translating, or 



10 



Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:5 which has an identity with SEQ ID NO:5 of has at least 50%; 



d) the ClpP4-protease is encoded by a nucleic acid sequence which com- 
15 prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ ID NO:7, or 

20 ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

netic code, can be deduced from the amino acid sequence shown In 
SEQ ID NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
25 NO:7 which has an identity with SEQ ID NO:7 of has at least 50%; 

e) the ClpP6-protease is encoded by a nucleic acid sequence which com- 
prises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in 

SEQ ID NO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 

35 SEQ ID NO: 1 0 by back translating, or 

Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:9 which has an identity with SEQ ID NO:9 of has at least 50%; 

40 f) the ClpR1 -protease is encoded by a nucleic acid sequence which com- 

prises: 
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i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ ID NO:11,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO: 12 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:11 which has an identity with SEQ ID NO:11 of has at least 50%; 

g) the ClpR3-protease is encoded by a nucleic acid sequence which com- 
prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
15 SEQlDNO:13,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:14 by back translating, or 

20 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:13 which has an identity with SEQ ID NO:13 of has at least 50%; 

h) the ClpR4-protease is encoded by a nucleic acid sequence which com- 
25 prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ ID NO: 15, or 

30 ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:16 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
35 NO:1 5 which has an identity with SEQ ID NO:15 of has at least 50%; 

i) the CIpP like-protease is encoded by a nucleic acid sequence which com- 
prises: 

40 i) a nucleic add sequence with the nucleic acid sequence shown in 

SEQIDNO:17, or 
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fi) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO: 18 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:17 which has an identity with SEQ ID NO:17 of has at least 50%; 

13. A method as claimed in claim 10, 11 or 12, wherein a test compound is selected 
which reduces or blocks the enzymatic or biological activity of Clp-protease. 

14. A method as claimed in any of claims 10, 11, 12 or 13. wherein 

i. either Clp-protease is expressed in a transgenic organism or an organism 
which naturally contains Clp-protease is grown, 

ii. the Cfp-protease of step i) is brought into contact with a test compound in 
the cell digest of the transgenic or nontransgenic organism, in partially puri- 
fied form or in homogeneously purified form, and 

20 iii. selecting a test compound which reduces or blocks the enzymatic activity of 

the Clp-protease of step a). 



15 



25 



40 



15. A method as claimed in any of claims 10, 11, 12 or 13. which comprises the fol- 
lowing steps: 

L generating a transgenic organism comprising a nucleic acid sequence en- 
coding Clp-protease, wherein Clp-protease is expressed recombinant^; 



ii. applying a test substance to the transgenic organism of i) and to a non- 
30 transgenic organism of the same genotype, 

iii. determining the growth or the viability of the transgenic plant and the non~ 
transgenic plant after application of the test compound, and 

35 iv. selecting test substances which bring about a reduced growth of the 

nontransgenic plant in comparison with the growth of the transgenic plant. 



16. A method as claimed in claim 15, which is carried out in a plant organism, a cya- 
nobacterium or proteobacterium. 

17. A method for identifying substances with growth-regulatory activity, which com- 
prises the following steps: 



Ze:ST = U £003*ZT"Z0 £90 



BASF Aktiengesellschaft 



20030949 



PF 55117EP 



10 



57 

i. generating a transgenic plant comprising a nucleic acid sequence Clp- 
protease, wherein CIp-protease is expressed recombinant^; 

iL applying a test substance to the transgenic plant of i) and to a nontrans- 
genie plant of the same variety, 

iii. determining the growth or the viability of the transgenic plant and the non- 
transgenic plant after application of the test compound, and 

iv. selecting test substances which bring about a reduced growth of the non- 
transgenic plant in comparison with the growth of the transgenic plant. 

18. A method as claimed in any of claims 10 to 17, wherein the substances are iden- 
15 tified in high-throughput screening method. 

19. A support comprising one or more of the nucleic acid molecules as claimed in 
claim 3, 4, or 5 one or more expression cassettes as claimed in claim 7 t one or 
more vectors as claimed in claim 8, one or more organisms as claimed in claim 9 

20 or one or more (poly)peptides as claimed in claim 6. 

20. A method as claimed in any of claims 1 0 to 1 8, wherein the substances are iden- 
tified in High-Throughput Screening using a support as claimed in claim 19. 

25 21. A compound with herbicidal activity, identified by one of the methods as claimed 
in any of claims 10 to 16, 18 and 20. 



30 



22. A compound with growth-regulatoiy activity, identified by the method as claimed 
in any of claims 17, 1 8 or 20. 

23. A method for the preparation of an agrochernical composition, which comprises 



a) identifying a compound with herbicidal activity by one of the methods as 
claimed in any of claims 10 to 16, 18 and 20 or a compound with growth- 

35 regulatory activity as claimed in any of claims 1 7, 1 8 or 20, and 

b) formulating this compound together with suitable auxiliaries to give crop 
protection products with herbicidal or growth-regulatory activity. 

40 24. A method for controlling undesired vegetation and/or for regulating the growth of 
plants, which comprises allowing at least one CIp-protease inhibitor as claimed in 
claim 21 or 22 or an agrochernical composition comprising a CIp-protease inhibi- 
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tor as claimed in claim 21 or 22 to act on plants, their environment and/or on 
seeds. 

25. The use of a Clp-protease inhibitor as claimed in claim 21 or 22 or of an agro- 
5 chemical composition comprising a Clp-protease inhibitor as claimed in claim 21 

or 22 in a method as claimed in claim 24. 
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Glp-protease as target for herbicides 

The present invention relates to CIp-protease, which, when absent brings about re- 
duced growth and chlorotic leaves as target for herbicides. For this purpose, novel nu- 
5 oleic acid sequences encompassing SEQ ID NO:3, SEQ ID NO:11 and SEQ ID NO: 17 
and functional equivalents of SEQ ID NO:3, SEQ ID NO:11 and SEQ ID NO: 17 are 
provided. Moreover, the present invention relates to the use of CIp-protease in a 
method for identifying compounds with herbicidal or growth-regulatory activity, and to 
the use of the compounds identified by this method as herbicides or growth regulators. 
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SEQUENCE ^ LISTING 

<110> BASF Aktiengeeellschaf t 

<120> dp-protease as target for herbiaides 

<X30> 20030949 

<X60> 33 

<170> Patentln version 3.1 

<210> 1 

<211> 591 

<212> PNA 

<213> Arabidopsie thaliana 
<220> 

<221> CDS 

<222> <D - • (591) 

<223> 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<400> 1 

atg cct att ggc gtt oca aaa gta cct ttt cga agt act gga gaa gga 
Met Pro lie Gly val Pro Lye val Pro Phe Arg Ser Pro Gly Glu Gly 
15 10 15 

gat aca tct tgg gbt gac afca fcac aac cga ctb tat cga gaa aga tea 
Asp Thr Ser Trp Val Asp lie Tyr Asn Arg Leu Tyr Arg Glu Arg Leu 
20 25 30 

ttt ttt tta ggc caa gag get gat acc gaa ate teg aat caa etc att 
Phe Pile Leu Gly Gin Glu Val Asp Thr Glu lie Ser Aan Gin Leu lie 
35 40 45 

agt cct atg afca tat etc agt ata gaa aag gat acc aaa gat cfct tat 
Ser Leu Met lie Tyr Leu ser He Glu Lye Asp Thr Lys Aop Leu Tyr 
50 55 60 

tfcg ttt afca aac tct cct ggt gga tgg gta ata tct gga atg get att 
Leu Phe He Asn Ser Pro Gly Gly Trp Val He Ser Gly Met Ala He 
65 70 75 90 

tat gat act atg caa ttt gtg cga ccc gat gta cag aca ata tgc atg 
Tyr Asp Thr Met Gin Phe Val Arg Pro Asp Val Gin Thr He Cys Met 
85 90 95 

gga ttg gec get tea ata gca tec ttt ate eta gtc gga gga gca att 
Gly Leu Ala Ala Ser He Ala Ser Phe lie Leu Val Gly Gly Ala He 
100 105 HO 

acc aaa cgt ata gca tfcc cct cac get agg gta atg ate cat caa ccc 
"Thr Lys Arg He Ala Phe Pro His Ala Arg Val Met He His Gin Pro 
115 120 125 

get agt teg ttt tat gag gca caa acg gga gaa tnt ate tfcg gaa gcg 
Ala Ser Ser Phe tyr Glu Ala Gin Thr Gly Glu Phe He Leu Glu Ala 

130 135 140 

gaa gaa tta ctt aaa efct cgc gaa acc ate aca agg gtt tat gta eaa 
Glu Glu Leu Leu Lys Leu Arg Glu Thr He Thr Arg Val Tyr Val Gin 
145 150 155 160 

aga acg ggc aaa cct ata tgg gtt ata tec gaa gac atg gaa egg gat 
Arg Thr Gly Lys Pro He Trp Val He Ser Glu Asp Met Glu Arg Asp 



46 



96 



144 



192 



240 



286 



336 



384 



432 



480 



528 
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165 170 175 

gtt ttt atg tea gca aca gaa gec caa get cat gga ate gtt gat ctt 576 
val Phe Met Ser Ala Thr Glu Ala Gin Ala Hig Gly lie Val Asp Leu 
S 180 185 ' 190 

gta gcg gtt Caa baa 591 
Val Ala Val Gin 
135 

10 

<210> 2 
<211> 3,26 
<2l2> PRT 
15 <213> Arabidopaia thaliana 

<4;Q0> 2 

Met Pro lie Gly Val Pro Lys Val Pro Phe Arg Ser Pro Gly Glu Gly 
20 l 5 10 15 



25 



30 



45 



50 



Asp Thr Ser Trp Val Asp lie Tyr Asn Arg Leu Tyr Arg Glu Arg Leu 
20 25 30 



Phe Phe Leu Gly Gin Glu Val Asp Thr Glu lie Ser Aen Gin Leu lie 
35 40 45 



Ser Leu Met He Tyr Leu Ser lie Glu Lys Asp Thr Lya Asp Leu Tyr 
50 55 SO 



35 Leu Phe lie Aen Ser Pro Gly Gly Trp Val lie Ser Gly Met Ala He 
SB 70 75 80 

Tyr Asp Thr Met Gin Phe Val Arg Pro Aap val Gin Thr He Cys Met 

40 95 90 95 



Gly Leu Ala Ala Ser He Ala Ser Phe He Leu Val Gly Gly Ala He 
100 105 110 



Thr Lys Arg He Ala Phe Pro His Ala Arg Val Met He His Gin Pro 
115 120 12S 



Ala Ser Ser Phe Tyr Glu Ala Gin Thr Gly Glu Phe He Leu Glu Ala 
130 135 140 



55 Glu Glu Leu Leu Lys Leu Arg Glu Thr He Thr Arg Val Tyr Val Gin 
145 150 155 160 

Arg Thr Gly Lye Pro He Trp Val He Ser Glu Agp Met Glu Arg Asp 
60 165 170 175 

Val Phe Met Ser Ala Thr Glu Ala Gin Ala Hie Gly He Val Asp Leu 
160 195 190 
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Val Ala val Gin 
195 



<210> 3 

<211> 1024 

<212> DNA 

10 <213> Wicotiana tafcacum 

<220> 

<221> CDS 

<222> (11) ..(877) 
15 <:223> 



20 



35 



40 



50 



55 



60 



<4Q0> 3 

gcggocgcta atg gcg gtc act ttt ceg ace ace tct tec teg tat eta 49 

Met Ala Val Thr Phe Pro Thr Thr Ser Ser Ser Tyr Leu 
* 5 10 



cac teg aga act aaa gbc ccc cag cot tot tba age tge gec age aaa 97 
His Ser Arg Thr Lys val Pro Gin pro Ser Leu Ser Cys Ala Ser Lys 
25 15 20 25 

gtt ttt gtc gga tta aga age caa tct cct aat tct tat ggg act gca 145 
Val Phe val Gly Leu Arg Ser Gin Ser Pro Asn Ser Tyr Sly He Ala 

30 30 35 

gcg tct aat gta aat gtt gaa tct cac aat aga gtg tac aga agt att 193 
Ala Ser Asn Val Aen Val Glu Phe His Asn Arg Val Tyr Arg Ser lie 
50 55 ~ €0 

gaa tec gga act aga gac agt aaa cca aca cgt gta cga gtt tec atg 241 
Glu Ser Gly Thr Arg Asp Ser Lys Pro Thr Arg val Arg Val Ser Met 
65 70 " 75 

atg ccc att ggg aca cca aga gta ccc tac aga aat cca act gag gga 289 
Met Pro He Gly Thr Pro Arg Val Pro Tyr Arg Asn Pro Thr Glu Gly 

eo es 90 

aca tgg cag tgg gtt gat ttg tgg aat get cct tac cgt gaa cgt gtt 337 
Thr Trp Gin Trp Val Asp Leu Trp Asn Ala Leu Tyr Arg Glu Arg Val 
45 95 100 105 



att ttc ate gga caa cac ata gat gaa gaa ttt age aac cag ata ttg 3B5 
He Phe He Gly Gin His He Asp Glu Glu Phe Ser Asn Gin He Leu 
lit) 115 120 125 

gca aca atg ctg tat ctt gac agt att gat gat tec aag aag etc tac 433 
Ala Thr Met Leu Tyr Leu Asp Ser He Asp Asp Ser Lys Lys Leu Tyr 
"0 135 140 

ctg tat ate aat ggc cct ggt ggt gat eta act cca age atg gec ate 481 
Leu Tyr He Asn Gly Pro Gly <3ly Asp Leu Thr Pro Ser Met Ala He 
145 150 155 

tac gac aca atg caa agt ctt aaa agt get gtt ggc ace cat tgt gtg 529 
Tyr Asp Thr Met Gin Ser Leu Lys Ser Ala Val Gly Thr His Cys Val 
160 165 170 

ggc tat gec tac aat Ctt gee ggt ttt ctt ctt get get gga gaa aag 577 
Gly Tyr Ala Tyr Asn Jjeu Ala Gly Phe Leu Leu Ala Ala Gly Glu *ys 



Ll 1 frT eOOZ'ZTZO 690 



BASF Aktiengesellschaft 



20030949 



FF 55117 EP 



175 180 



185 



10 



ggc aat cga ttt gca atg cat ctt tea agg att gca eta caa tot cca 625 
GLy Aen Arg Phe Ala Met Pro Leu Ser Arg lie Ala Leu Gin ser Pro 
190 1W 200 205 

?f b 2P ?? fc 9Cg cgc g9a cag gct » at * at att »t gaa gca gat 673 

Ala Gly Ala Ala Arg Gly Gin Ala Asp Asp He Arg Aen Glu Ala Asp 
210 215 220 

^ S?* f ga gat taC ctt ttc aa S * ag tt: ^ * ct 5ag aag 721 
Glu Leu Leu Arg He Arg Asp Tyr Leu Phe Lys Glu Leu Ala Glu Lya 

225 230 235 

15 5u a S? 9 ° Ct 9tt gaa a * g 9tc cac * a * tta ^ c 99 atg aag 769 
Thr Gly Gin Pro Val Glu Lya Val Ma Lys Asp Leu Ser A?g Met Lya 

240 545 250 

9H f tC g ? b 9aa 9Ct Ctt 9aa tat »*= Ctt afca 9 ac C&t 817 

20 Arg Leu Asn Ala Gin Glu Ala Leu Glu Tyr Gly Leu He Asp Arg He 

255 260 265 

gtt agg cct ccc cgt att aag gca gat get cca cga aag gat acc aca 865 
Val Arg Pro Pro Arg He Lys Ala Asp Ala Pro Arg Lys Asp Thr Thr 
25 270 275 280 285 



30 



35 



40 



45 



50 



Ala Gly Leu G^ Cagfcccat;ac aca tcgtafca atttatggct gatagtggtt 917 

gtacgacttg cagtgttatt ttgeaattte tcttgttcaa bataeatatt gaaccctttt 977 
gatctactta ttcaaaaaca tgaaatcgtg agcagactag eggcege X024 

<210> 4 

<211> 269 

<212> PRT 

<213> Nicotiana fcabacum 

<400> 4 

Met Ala Val Thr Phe Pro Thr Thr Ser Ser Ser Tyr Leu His Ser Arg 
1 5 10 15 

Thr Lys Val Pro Gin Pro Ser Leu Ser Cy© Ala Ser Lye Val Phe Val 
20 25 20 

Gly Leu Arg Ser Gin Ser Pro Asn Ser Tyr Gly He Ala Ala Ser Asn 
35 40 45 



55 Val Asn Val Glu Phe His Asn Arg Val Tyr Arg Ser He Glu Ser Gly 
50 55 60 

Thr Arg Asp Ser Lys Pro Thr Arg Val Arg Val ser Met Met Pro He 
60 65 70 75 90 

Gly Thr Pro Arg Val Pro Tyr Arg ASn Pro Thr Glu Gly Thr Trp Gin 
95 90 95 
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Trp Val Asp lieu Trp Aan Ala Leu Tyr Arg Glu Arg Val lie Phe tie 
100 105 110 

5 

Gly Gin His He Asp Glu Glu Phe Ser Asn Gin lie Leu Ala Thr Met 
11S 120 12$ 

10 

Leu Tyr Leu Asp Ser Xle Asp Asp Ser Lys Lye Leu Tyr Leu Tyr lie 
13 0 13 5 140 

15 Asn Gly Pro Gly Gly Asp Leu Thr Pro Ser Met Ala He Tyr Asp Thr 
145 150 155 160 

Men Gin Ser Leu Lys Ser Ala Val Gly Thr Hi a Cys Val Gly Tyr Ala 
20 1€5 170 175 

Tyr Asn Leu Ala Gly Phe Leu Leu Ala Ala Gly Glu Lys Gly Asn Arg 
180 1S5 190 

25 

Phe Ala Met Pro Leu Ser Arg Xle Ala Leu Gin Ser Pro Ala Gly Ala 
195 200 205 

30 

Ala Arg Gly Gin Ala Asp Asp Xle Arg Aan Glu Ala Asp Glu Leu Leu 
210 515 220 

35 Arg Zle Arg Asp Tyr Leu Phe Lys Glu Leu Ala Glu Lys Thr Gly Gin 
225 230 235 240 

Pro Val Glu Lya Val His Lys Asp Leu Ser Arg Met Lys Arg Leu Asn 
40 245 250 255 

Ala Gin Glu Ala Leu Glu Tyr Gly Leu lie Asp Arg He Val Arg Pro 
260 265 270 

45 

Pro Arg lie Lys Ala Asp Ala Pro Arg Lys Asp Thr Thr Ala Gly Leu 
275 280 295 

50 

Gly 



55 <210> 5 

<211j» 1124 

<212> BttA 

<213> Arabidopsis thaliana 

60 <220> 

<221> CDS 

<222> (2) . . (B31) 

<222> 
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<4Q0> 5 

a atg gag atg agt ttg cgt etc get tea tct tea acc tea aac cca att 49 
Met Glu Met Ser Leu Arg Leu Ala Ser Ser Sex Thr Ser Asn Pro He 
15 10 15 

tgt eta eta aac cct gga aaa aac act aat ttc cca ate cga aac cat 97 
Cye Leu Leu Asn Pro Gly Lys Asn Leu Asn Phe Pro lie Arg Asn His 
20 25 30 

aga ate cct aaa act teg aaa ccc ttt tgc gtt agg tct tea atg age 145 
Arg He Pro Lye Thr Ser Lys Pro Phe Cys Val Arg Ser Ser Met Ser 
35 40 45 



15 ttg tct aaa cca ccc aga caa acc tta tct agt aac tgg gat gta tct 193 
Leu Ser Lys Pro Pro Arg Gin Thr Leu Ser Ser Asn Trp Asp Val Ser 
SO 55 60 

age ttc tec att gat tec gtt get caa tct cct tea aga etc cca agt 241 
20 Ser Phe Ser He Asp Ser Val Ala Gin Ser Pro Ser Arg Leu Pro Ser 
65 70 75 80 

ttc gaa gaa etc gat acc acc aac atg ttg etc cgt caa aga ate gta 2S9 
Phe Glu Glu Leu Asp Thr Thr Asn Met Leu lieu Arg Gin Arg lie Val 
25 65 90 95 

ttt ttg ggt tct cag gtt gat gat atg acg gcg gat ttg gtt ata agt 337 
Phe Leu Gly Ser Gin Val Asp Asp Met Thr Ala Asp Leu Val lie Ser 
100 105 110 



cag eta ttg tta eta gat get gag gac tea gaa aga gac att acg ctt 385 
Gin Leu Leu Leu Leu Asp Ala Glu Asp Ser Glu Arg Asp He Thr Leu 
115 120 125 



35 ttt ate sat tea ccc ggt gga tct att act get ggg atg gga ata tat 433 
Phe He Asn Ser Pro Gly Gly Ser He Thr Ala Gly Met Gly He Tyr 
130 135 140 

gat gca atg aaa caa tgt aag gcg gat gta cct act gtt tgc tea ggg 401 
40 Asp Ala Met Lye Gin Cys Lye Ala Asp Val Ser Thr Val Cys Leu Gly 
145 150 155 160 

tta get gca tct atg ggt gcg ttt ctt ctt get tct ggt tea aaa ggg 529 
Leu Ala Ala Ser Met Gly Ala Phe Leu Leu Ala Ser Gly Ser Lyg Gly 
45 165 170 175 

aaa egg tat tgt atg cct aac tct aaa gtt atg ate cat cag cca ctt 57 7 

Lys Arg Tyr Cys Met Pro Asn Ser Lys Val Met He His Gin Pro Leu 
190 165 190 



ggt act get gga ggc aaa gca acg gaa atg ago ata cgt ata aga gaa 625 
Gly Thr Ala Gly Gly Lys Ala Thr Glu Met ser He Arg He Arg Glu 
195 200 205 



55 atg atg tac cac aag att aaa ctt aac aaa ate ttc tct aga ate act $73 

Met Met Tyr His Lys He Lys Leu Asn Lys He Phe Ser Arg He Thr 

210 215 220 

ggg aag cct gaa tea gag ate gaa agt gac aca gac cgt gat aac ttc 721 

60 Gly Lys Pro Glu Ser Glu He Glu Ser Asp Thr Asp Arg Asp Asn Phe 

225 230 235 240 

ttg aat cca tgg gag gcg aaa gaa tat ggt ttg ate gac get gta ate 769 

Leu Asn Pro Trp Glu Ala Lys Glu Tyr Gly Leu He Asp Ala Val He 
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245 250 255 

gat gat ggg aaa ccg gga eta ate get cca att gga gat ggt act cct ai7 
Asp Asp Gly Lye Pro Gly Leu He Ala Pro lie Gly Asp Gly Thr Pro 
260 265 * 270 

cct cct aaa ace aaa gtc tgg gat ctt tgg aaa gtc gaa gga acc aag 865 
Pro Pro Lys Thr Lye Val Trp Asp Leu Trp Lys Val Glu Gly Thr Lys 
275 260 285 

aaa gac aac act aac ttg cca tct gag ego tec atg aca cag aat ggt 913 
Lys Asp Asn Thr Asn Leu Pro Ser Glu Arg Ser Met Thr Gin Agn Gly 
290 300 



15 tac gec gec att gaa tag aactgttgtt geagegttta cgccttctat 961 
Tyr Ala Ala He Glu 
305 



atgttattct ggtggtacct gtaaccatac aacgttgcat ttcctgtgtt tgtaccattt 1021 
ctctgataga ttttggaata atttgaaggc aaagatagat tattgtgtag agagecacaa 1081 
atttaatgat aaattgatca tcagcaetgg aaagctaaaa aaa 1124 

<210> 6 
<211> 309 
<2l2> PRT 

«213> Arabidopsis thaliana 
<400> 6 

Met Glu Met: Ser Leu Arg Leu Ala Ser Ser Ser Thr Ser Asn Pro He 
15 10 IS 

Cye Leu Leu Asn Pro Gly Lye Asn Leu Asn Phe Pro He Arg Asn Hie 
20 25 30 

Arg He Pro Lye* Thr Ser Lys Pro Phe Cye Val Arg Ser Ser Met Ser 
35 40 45 



45 Leu Ser Lys Pro Pro Arg Gin Thr Leu Ser Ser Asn Trp Asp Val Ser 
50 55 60 

Ser Phe Ser He Asp Ser Val Ala Gin Ser Pro Ser Arg Leu Pro Ser 

50 65 70 75 60 



Phe Glu Glu Leu Asp Thr Thr Asn Met Leu Leu Arg Gin Arg He Val 
85 90 95 



Phe Leu Gly Ser Gin Val Asp Abp Met Thr Ala Asp Leu Val He Ser 
100 105 110 



Gin Leu Leu Leu Leu Asp Ala Glu Asp Ser Glu Arg Asp He Thr Leu 
115 "* 120 125 
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Phe lie Asn Ser Pro Gly Gly ser lie Thr Ala Gly Met Gly He Tyr 
130 135 14 0 

5 Asp Ala Met: Lys Gin Cye Lys Ala Asp Val fier Thr val Cys Leu Gly 
145 3.50 155 * l6 l 

Leu Ala Ala Ser Met Gly Ala Phe Leu Leu Ala ser Gly Ser Lys Gly 
10 165 i 70 ^ 5 * 

Lys Arg Tyr Cys Met Pro Asn Ser Lys Val Met lie His Ola Pro Leu 
15 180 1*5 1*0 



20 



Gly Thr Ala Gly Gly Lys Ala Thr Glu Met Ser He Arg He Arg Glu 
195 200 205 

Met Met Tyr His Lys He Lyg Leu Aen Lys He Phe Ser Arg He Thr 
210 215 220 



25 Gly Lys Pro Glu Ser Glu He Glu Ser Asp Thr Asp Arg Asp Asn Phe 
225 230 235 " ~ 240 

Leu Asn Pro Trp Glu Ala Lys Glu Tyr Gly Leu He Asp Ala Val He 
^ U 545 250 



255 



Asp Asp Gly Lys Pro Gly Leu He Ala Pro He Gly Asp Gly Thr Pro 

35 



260 2S5 270 



40 



Pro Pro Lys Thr Lys val Trp Asp Leu Trp Lys Val Glu Gly Thr Lya 
275 290 285 

Lys Asp Asn Thr Asn Leu Pro Ser Glu Arg Ser Met Thr Gin Asn Gly 
290 295 300 



45 Tyr Ala Ala He Glu 
305 



<210> 7 

50 <211> 1163 

<2X2> DNA 

<213> ArabidopBis Chaliana 



<220> 
55 <221> CDS 

<222> (3).. (902) 
<223> 



60 <400> 7 



Ct nS C n£ G S5 C ^ 0 gCC ^ 9ga 3CC Cta tCSt Ctc tCft tat 47 

Phe Phe Phe Phe Ala Ser Ala Met Gly Thr Leu Ser Leu Ser Ser 
1 5 10 is 



3i:8T:n EOOZ'ZT'ZO bLO 
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tit 11? £ Ct T CtC 9tt tCa tca a9a Gtc aaC "t too tec 9 5 

Ser Leu Lye Pro Ser Leu Val Ser Ser Arg Leu Asn Ser Ser Ser Ser 
20 25 30 

5 £a Ser S l 0t «° S " fc ° Ct 3aa CCa aac aat cfcc * a ° aaa ccc 143 

Ala Ser Ser Ser Ser Phe Pro Lye Pro Asn Aa n Leu Tyr Leu Lye Pro 

35 40 45 

10 f C ° i?* toa Cca cct atc aga aca a <"= tea cca tea ace ttg 191 

10 Thr Lys Leu He Ser Pro Pro Leu Arg Thr Thr Ser Pro Ser Pro Leu 



25 



45 



50 



55 60 



* 9 ^° 9 ?° aat SCt Cca atc 9a9 at 9 tc 9 cag aca oag gaa cca get 230 
Arg Phe Ala Asn Ala Ser He Glu Met Ser Gln Thr Gin Ilu Ser 111 

ill a™ r? a 2? 8 ^ Ct 9a ° 9tC afc £ f ^ otc ct « ^99 cga 297 

lie Arg Sly Ala Glu Ser Asp val Met Gly Leu Leu Leu Arg Ilu Arg 

20 85 90 ' 95 



a if SS? CtC 2? b a9t asft atc 9ac gat tto gtc got gat get atc 

He Val Phe Leu Gly Ser Ser He Asp Asp Phe Val La lap lie He 

100 105 no 

m~? Z? 3 bCg Ctt Ctc tta gafc 9Ct aaa 9at cca aag aa a gat atc 
Met Ser Gin Leu Leu Leu Leu Asp Ala Lys Asp Pro Lys Ly s lap He 

115 120 125 



cot ctt gga ggt gca age ggt caa got ata gat gtu gag att caa act 
Pro Leu G ly Qiy Ai a Ser OXy Gln L a n e Lp Val Ilu He Gin lie 
195 200 205 



aH 9 1 M t" 5 tCt CCa abt aaa 9ca 9tt 9 a 9 tab 99t tta att gat gga 
Arg Tyr Met Ser Pro lie Glu Ala Val Glu Tyr Gly Leu He Asp Ily 

60 250 255 



335 



383 



™ f t ? a ? C aat ccfc cot " fc " b tct c to agt gca acc atg got 431 

30 Lys Leu Phe He Asn Ser Pro Gly Gly Ser Leu Ser La Thr Me? Ka 
130 135 140 

?? a Sf C ? at S C ? 9tC Cag ocfc 9t 9 a9a 9Ct 9 at 9tt teg acg att get 479 
35 lis" Va Gln Jlo Val *** Ala Aap ^ al Th * Ile 

ctt ggc att got gca tea aca get tog att att ctt ggt gcg gga act 527 
Leu Gly He Ala Ala Ser Thr Ala Ser He He Leu Sly lie Gly Sr 
40 165 170 175 

Lvs fflE ^ IV ff b CCQ aa ° a ° 9 ag9 aCa at9 att Ga * «a 575 

Ly 3 Gly Lys Arg Phe Ala Met Pro Asn Thr Arg Ile Met He His Gln 

lfi0 195 190 



€23 



*5fl ? 9 2 ? 9 Cat aac aa9 aac aat 9tc acc age att atc gog gga 671 
50 Lys Glu Val Met His Asn Lys Aan Aan Val Thr Ser He lie III ITv 
210 21S 220 

S? b a ^ fc a9t ° 9a tC9 ttC 9ag cag 9tt cb 9 a »a gat att gat agg gac 719 
Cys Thr Ser Arg Ser Phe Glu Gln Val Leu Lys Lp He Lp Arf Sp 
Mv 230 235 



767 



SSf ? ? aC agC att att CCt Gfcb cot OCt gat aga Qi 5 

Val lie Asp Gly Aap Ser He lie Pro Leu Glu Pro Val Pro Lp Arg 

2S0 265 5 70 



8i:*T C00Z - ZT # ?0 9L0 
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45 



50 



10 

gtg aaa cog aga gta aaa tac gag gag act age aag gat acg atg aaa Q63 

Val Lys Pro Arg Val Agn Tyr Glu Glu He Ser Lys Asp Pro Met Lys 

275 280 285 

ttc ttg act ccc gag ata cct gat gat gag ate tac taa agccaagctc 912 
Phe Leu Thr Pro Glu lie Pro Asp Asp Glu lie Tyr 
290 295 



10 


gtctagaagc 


agggatcttc 


aaatgtgact 


aagactagca gtttcgagga 


aaagctcaat 


972 


ttcttctgcg 


gttactggta ttggctttgc 


gaaaccgaag ctggtagcac 


ttggcttttg 


1032 




tatctcatat 


ttcagttgtt 


cagaaaataa 


ttgttcttta aatcactctg 


ttttgaggaa 


1092 


15 


aatgacttaa 


agaagctgta 


gttatctcgt 


ttatgacaat cccttcaagt gtttaatgga 


1152 




ttcaagaagt 


ateagecagt 


atttttgtgg 


t 




1163 



20 <2io> s 

<211> 259 

<212> PRT 

<;2l3> Arabidopais thaliana 

25 <400> a 



Phe Phe Phe Phe Ala Ser Ala Met Gly Thr Leu Ser Leu Ser Ser Ser 
15 10 15 



Leu Lys Pro Ser Leu Val Ser Ser Arg Leu Asn Ser Ser Ser Ser Ala 
20 25 30 



35 Ser Ser Ser Ser Phe Pro Lye Pro Aon Aen Leu Tyr Leu Lys Pro Thr 
35 40 45 

Lys £.eu He Ser Pro Pro Leu Arg Thr Thr Ser Pro Ser Pro Leu Arg 
40 50 55 60 



Phe Ala Aan Ala Ser He Glu Met Ser Gin Thr Gin Glu Ser Ala lie 
65 70 75 80 



Arg Gly Ala Glu Ser Asp val Met Gly Leu Leu Leu Arg Glu Arg He 
65 90 95 



Val Phe Leu Gly Ser Ser He Asp Asp Phe Val Ala Asp Ala He Mec 
100 105 110 



55 Ser Gin Leu Leu Leu Leu Aep Ala Lys Asp Pro Lya Lys Asp He Lys 
115 120 125 

I*eu Phe He Asn Ser Pro Gly Gly Ser Leu Ser Ala Thr Met Ala He 
60 130 135 140 

Tyr Asp Val Val Gin Leu Val Arg Ala Abp Val Ser Thr He Ala Leu 
145 150 155 160 



BASF Aktlengesellschaft 20030949 pp 551 1 7 EP 

11 

Gly He Ala Ala Ser Thr Ala Ser lie n e Leu Qly Ala sly Thr 
5 165 170 175 



10 



Oly Lys Arg Phe Ala Met Pro Asn xhr Arg He Met He Hie Gin Pro 
180 135 190 

Leu Gly Gly Ala Ser Qly Gin Ala lie Asp Val Glu He Gin Ala Lys 
195 200 205 



15 JlJ A9tt ^ JJJ AaD VSl Thr Ser Ile Ala Gly Cy S 



220 



20 J£ Gln VSl LeU AS P Asp Arg Asp Arc, 

235 240 

Tyr Met Ser Pro lie Glu Ala Val Glu Tyr Gly Leu He Asp Gly Val 
25 24S 250 255 



30 



Tie Asp Gly Asp Ser He He Pro Leu Glu Pro Val Pro Asp Arg Val 
260 265 270 

Lys Pro Arg Val Abu Tyr Glu Glu He Ser Lys Asp Pro Met Lys phe 
275 280 285 



35 Leu thr Pro Glu 



230 295 



He Pro Asp Asp Glu He Tyr 



<210> 


9 


<2ll> 


1056 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(SI) - , (876) 


<223> 




<400> 


9 



55 



f?! 2?* CCt CCt CtC 93* Ctt ttC tct tot 10B 

Met Ala Gly Leu Ala He Ser Pro Pro Leu Gly Leu Ser Phe Ser Ser 
3 10 15 

Aro ££ ° Ct f aa CCC act tcc fctt cC * tct cac aat caa agg i 56 

Arg Thr Arg Asn Pro Lys Pro Thr Ser Phe Leu Ser His Asn Gin Ar| 

25 3 q 

Jsn S ?S gt ? tCt gCfc Cta a * fc ^ Sff* gat 

Asn Pro lie Arg Arg xie Val Ser Ala Leu Gin ser Pro Tyr Gly Asp 
35 40 45 



204 



6T:*T EOOZ'ZT-ZO 

J J *C T7DQbCIQ T^Q Cb4- 
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20 



25 



40 



45 



60 



12 

tct ctg aaa gcc gga ctt tct agt aat gtt bet gga tec cca ata aag 
Ser Leu Lys Ala Gly Leu Ser Ser Abii Val Ser Gly Ser Pro He Lys 
50 55 6 o 



etc tct tct gtg tta tec cgt aac cgc ata ate ttc acc ggg caa cca 
Leu Ser Ser Val Leu Phe Arg Asn Arg He Xle Phe He Gly Gin Pro 
15 100 105 110 



252 



att gac aac aag get cca aga ctt gga gtg ata gag gcg aaa aag gga 300 
He Asp Aan Lys Ala Pro Arg Phe Gly Val He Glu Ala Lys Lys Gly 
65 70 75 ' SO 

aac ccc cca gta atg cct tea gtg atg acc cct gga gga cct tta gac 348 
Asn Pro Pro Val Met Pro Ser Val Met Thr Pro Gly Gly Pro Leu Asp 
B5 90 " 95 



396 



att aac gca cag gtt get cag aga gtc ata tct cag ctt gta acc ctt 444 
He Asn Ala Gin Val Ala Gin Arg Val He Ser Gin Leu Val Thr Leu 
115 120 125 

gca tct att gat gat aaa tec gac ate ctg atg tac ttg aat tgt ccc 492 
Ala Ser He Asp Aap Lys Ser Asp He Leu Met Tyr Leu Asn Cys Pro 
130 135 140 

ggt ggc agt act tac tec gtc eta aca att tat gac tgt atg tct tgg 540 
Gly Gly Ser Thr Tyr Ser Val Leu Thr He Tyr Asp Cy© Met Ser Trp 
145 150 155 ~ 160 

ata aag cct aaa gtt gga aca gtg gcg ttt gga gta get gca age caa 5B8 
30 He Lys Pro Lys Val Gly Thr Val Ala Phe Gly Val Ala Ala Ser Gin 

165 170 175 

gga gca ttt ttt ctt get gga ggt gaa aaa gga atg cgt tat gca atg 636 
Gly Ala Phe She Leu Ala Gly Gly Glu Lys Gly Met Arg Tyr Ala Met 
35 l$o 185 190 



cca aat act cgt gtc atg ata cat caa cca caa act gga tgc gga gga 6S4 
Pro Asn Thr Arg Val Met He His Gin Pro Gin Thr Gly Cys Gly Gly 
195 200 205 

cat gta gag gac gtg agg aga cag gtc aat gaa gcc ate gaa gcc cga 732 
His Val Glu Asp Val Arg Arg Gin Val Asn Glu Ala He Glu Ala Arg 
210 215 220 

caa aaa ate gac agg atg tat gca get ttc act gga caa cct ctg gag 780 
Gin Lye He Aep Arg Met Tyr Ala Ala phe Thr Gly Gin Pro Leu Glu 
225 230 235 240 



aaa gcg cag caa tac act gaa aga gac cgt ttc tta tea gca cct gag 828 
OU Lys Val Gin Gin Tyr Thr Glu Arg ASp Arg Phe Leu Ser Ala ser Glu 

245 250 255 

gcg ttt gag ttc ggg etc att gat ggt eta ttg gaa aca gaa tac tga 876 
Ala Phe Glu Phe Gly Leu He Asp Gly Leu Leu Glu Thr Glu Tyr 
55 2G0 2$5 270 

agcagcatac aggacaatgc acaacaacag etcattgeaa tgttcaaagc ttccattttc 936 

atttgaatat gaacggttgt aactgatatt tgtgeataaa tcagtttggt tttcttggtt 996 

ttattgtcta ctaaacagaa tgagaaaact aaactgtcta ttt ttt tact gaaaaatctg 105 6 

«£310> 10 
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<211> 271 
<ZXZ> PRT 

<213> Arabidopsis thaliana 
5 <400> 10 



10 



30 



50 



Met Ala Gly Leu Ala He Ser Pro Pro Leu Gly Leu Ser Phe Ser Ser 
1 5 1° 15 

Arg Thr Arg Asn Pro Lys Pro Thr ser Phe Leu Ser His Asm Gin Arg 
20 25 so 



15 Aen Pro He Arg Arg He val Ser Ala Leu Gin Ser Pro Tyr Gly Asp 
33 40 45 

On LyS Ala Gly Leu Ser Sez " Aan Val fi « r <3ly Ser Pro He Lys 

50 55 60 

lie Asp Aan fcys Ala Pro Arg Phe Gly Val He Glu Ala Lys Lys Gly 

25 * S 70 75 90 y 



Aen Pro Pro Val Met Pro Ser val Met Thr Pro Gly Gly Pro Leu Asp 
35 90 95 

Leu Ser Ser Val Leu Phe Arg Aan Arg He He Phe He Gly Gin Pro 
100 105 no 



35 He Aen Ala Gin Val Ala Gin Arg Val He Ser Gin Leu Val Thr Leu 
115 120 a25 

Ala Ser He Asp A 3p Lys Sar Asp He Leu Met Tyr Leu Asn Cys Pro 
W 130 135 140 

Gly Gly ser Thr Tyr Ser val Leu Thr lie Tyr Asp Cys Met Ser Trp 
45 145 150 155 ISO 



He Lys Pro Lys Val Gly Thr val Ala Phe Gly Val Ala Ala Ser Gin 
165 170 175 

Gly Ala Phe Phe Leu Ala Gly Gly Glu LyS Gly Mec Arg T yr Ala Met 
180 is5 ~ 1$0 



55 Pro Asn Thr Arg Val Met He His Gin Pro Gin Thr Gly Oyg <ji y G ly 
195 200 205 

fin HiS ofi G1U Val 4x3 GXn val Aan Glu Ala Ile Glu Ala Arg 
w 210 215 220 

Gin Lys He Asp Arg Met Tyr Ala Ala Phe Thr Gly Gin Pro Leu Glu 
225 "0 235 240 



5T-*frT e00Z*3T-Z0 6L0 
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Lys val Gin Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Ala Ser 01 u 
245 250 255 



Ala Phe Glu Phe Gly Leu He Asp Gly Leu Leu Glu Thr Glu Tyr 
260 265 270 



<210> 11 

«211> 1448 

<212> DNA 

<213> Nicotiana tabacum 
<220> 

^22X> CDS 

<222> (2) . . (1162) 

<223> 



<400> 11 

g egg ccg ctg get tat tct ttg ctt etc tct ccg ctt tct ago teg acg 49 

Arg Pro Leu Ala Ser Ser Leu Leu Leu Ser Pro Leu Ser Ser Ser Thr 
25 1 5 10 15 



gtt act gaa aat cga gag ccg ggt tct ggt aaa tea act ttc ata tec 97 
Val Thr Glu Asn Arg Glu Leu Gly Ser Gly Lye Ser Thr phe He Ser 
20 25 30 

agt ccc aat ttc tec ttt gca act tct gtt cac agt tgc agg cca aac 145 
Ser pro Asn Phe Ser Phe Ala Thr Ser Val His Ser Cys Arg Pro Asn 
35 40 45 



35 ggc gtt cga ggt tat tgt tac agg tct ccg gca get aag tct ttg gac 193 
Gly Val Arg Gly Tyr Cye Tyr Arg Ser Pro Val Ala Lys Ser Leu Asp 
50 55 60 

cat ata ccc caa aaa ttc aga ctg gaa aac etc aaa gat gga eta ctg 241 
40 His He Pro Gin Lye Phe Arg Leu Glu Asn Leu Lye Asp Gly Leu Leu 
65 70 75 80 

gac aac tat aaa agt gca cct cag tat ctt cac ggc ctt agt cct tea 289 
Asp Aen Tyr Lyg Ser Ala Pro Gin Tyr Leu Tyr Gly Leu Ser Pro Ser 

45 as 90 95 

cag atg gat atg ttc atg aaa gaa gat aaa cca gta egg cga cag tea 337 
Gin Met Asp Met Phe Met Thr Glu Asp Aen Pro Val Arg Arg Gin Ser 
100 105 110 



gaa agt gee aat gag gat agt ata tct tea gec aat aac tat ctg age 3B5 
Glu Ser Ala Thr Glu Agp Ser He Ser Ser Ala Asn Asn Tyr Leu Ser 
115 120 125 



55 aat ggt gga atg tgg agt atg tec ggc atg aac gat egg ggc ccc teg 433 
Aen Gly Gly wet Trp Ser Met Ser Gly Met Asn Asp Arg Gly Pro Ser 
130 135 140 

aaa tac agt atg agt gtc age atg taa cgt gga gga aaa aga gga tct 481 
60 Lys Tyr Ser Met Ser Val Ser Met Tyr Arg Gly Gly Thr Arg Gly Ser 
145 150 155 160 

gga aga cct cga act gcg act cct gat ttg eca tct ttg ctt ttg gac 52 9 

Gly Arg Pro Arg Thr Ala Pro Pro Asp Leu Pro Ser Leu Leu Leu Asp 



6T:*T E002'2T # Z0 080 , _ 
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165 



15 

170 



175 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



get cga att gtc tat ctg ggc atg cct att gta cca get gtt aca gag 577 
Ala Arg He Val Tyr Leu Gly Met Pro He Val Pro Ala val Thr Glu 
1B0 155 190 

ctt ctt gtt get cag ttt atg tgg ttg gat tat gac aat cca tea aag 625 
LOU Leu Val Ala Gin Phe Met Trp Leu Asp Tyr Asp Asn Pro Ser Lye 
195 200 205 

cct ata tac eta tat aca aac tea tea ggc aca cag aat gag aag atg 673 
Pro He Tyr Leu Tyr He Asn Ser Ser Gly Thr Gin Asn Glu Lys Met 
210 215 220 

gag act gtt ggg tot gaa aca gag gca tat gec ate get gac aca atg 721 
Glu Thr Val Gly Ser Glu Thr Glu Ala Tyr Ala He Ala Asp Thr Met 
335 230 235 240 

gca tac tgc aaa tea gat ate tat aca gtg aac tgt ggc atg gca tat 769 
Ala Tyr Cys Lyg Ser Asp He Tyr Thr Val Asn Cya Gly Met Ala Tyr 
245 250 255 

ggt caa gca gca atg ctt ctg tea ctg gga aag aag ggg ttc cgt got 817 
Gly Gin Ala Ala Met Leu Leu ser Leu Gly Lys Lye Gly phe Arg Ala 
260 265 " * 270 

atg cag cca aat tea tct aca aaa ttg tac eta cct aaa gtc age aaa 865 
Met Gin Pro Asn Ser Ser Thr Lya Leu Tyr Leu Pro Lys Val Ser Lys 
27$ 280 " 205 

tec age gga gca gtg ata gat atg tgg ate agg gec aaa gaa eta gat 913 
Ser Ser Gly Ala Val He Aep Met Trp He Arg Ala Lys Glu Leu Agp 
290 295 300 

gca aae act: gag tat tac ctt gaa eta tta gcg aaa gga gtt gga aaa 961 
Ala Asn Thr Glu Tyr Tyr Leu Glu Leu Leu Ala Lys Gly Val Gly Lys 
305 310 315 320 

cca aag gaa gaa ate gag aaa gat att caa age cct aaa tat ctg egg 1009 
Pro Lye Glu Glu He Glu Lys Asp He Gin Arg Pro LyB Tyr Leu Arg 
325 330 335 

gca caa gaa gec att gac tat ggc att gcg gac aag ata ate gat tea 1057 
Ala Gin Glu Ala He Asp Tyr Gly He Ala Asp Lys He He Asp Ser 
340 345 350 

aga gac aat gca ttfc gag aaa agg aac ttab ggt gag ata etc gee caa H05 
Arg Asp Asn Ala Phe Glu Lys Arg Asn Tyr Gly Glu He Leu Ala Gin 
355 360 3£S 

tct aga get atg agg aaa gec gga cca ggt get eag get get cca tct 1153 
Ser Arg Ala Met Arg Lys Ala Gly Pro Gly Ala Gin Ala Ala Pro Ser 
370 375 380 

ggc tec agg tgactggaag ageggtaatg gtcccaagct ttcaggaaca 1203 
Gly Ser Arg 

385 

actgttgttc ccttatagtt tcgaggaaca aagttgctgg ttacttggtc tgtgccggta 1262 

taatgtaact gggaeaaaga acatattgta gaaaccttgt ttgagctgtg aagtataggg 1322 

gttttacaac tattatgeae aggtctgeaa agagtaccca taatgtcaat tggttgtacc 1382 



02=frT COOZ'ZT'ZO 180 
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agtatcaaac aatcagatag tgccagtgta tggtataaat gaatatagat ctctctgagc 1442 
ggccgc 1449 

5 

<210> 12 
<2ll> 387 

<212> PfcT 

<2l3> Nicotiana tafcacum 

10 

<400> 12 

Arg Pro Leu Ala Ser ser Leu Leu Ser Pro Leu Ser Ser Ser Thr 

is 1 

Val Thr Glu Asn Arg Glu Leu Gly Ser Gly Lys Ser Thr Phe He Ser 
20 25 30 

20 

Ser Pro Asn Phe Ser Phe Ala Thr Ser Val h!b Ser Cya Arg Pro Asn 
35 40 45 

25 Gly Val Arg Gly Tyr Cys Tyr Arg Ser Pro Val Ala Lys Ser Leu Asp 
50 55 60 

His He Pro Gin Lys Phe Arg Leu Glu Agn Leu Lys Asp Gly Leu Leu 
30 65 70 75 ^ qq 

Asp Asn. Tyr LyS Ser Ala Pro Gin Tyr Leu Tyr Gly Leu Ser Pro Ser 
S5 90 ~ 95 

35 

Gin Meb Asp Met Phe Met Thr Glu Asp Asn Pro Val Arg Arg Gin Ser 
100 105 110 

40 

Glu Ser Ala Thr Glu Asp Ser He Ser Ser Ala Asn Asn Tyr Leu Ser 
115 130 125 

45 Asn Gly Gly Met Trp Ser Met Ser Gly Meb Aan Asp Arg Gly Pro Ser 
130 135 140 

Lys Tyr Ser Met ser Val Ser Met Tyr Arg Gly Gly Thr Arg Gly Ser 
50 145 150 155 160 

Gly Arg pro Arg Thr Ala Pro Pro Aap Leu Pro Ser Leu Leu Leu Asp 
1£5 170 175 

55 

Ala Arg He Val Tyr Leu Gly Met Pro He Val Pro Ala Val Thr Glu 
100 1Q5 190 

60 

Leu Leu Val Ala Gin Phe Met Trp Leu Asp Tyr Asp Asn Pro ser Lys 
195 200 205 



£Z:02=l7T eOOZ*2VZO 380 
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Pro Xle Tyr Leu Tyr He ABn Ser Ser Gly Thr Gin Asn Glu Lys Met 
210 21S 220 

5 Glu Thr Val Gly Ser Glu Thr Glu Ala Tyr Ala Xle Ala Asp Thr Met 
225 230 235 24Q 

Ala Tyr Cys Lys Ser Asp He Tyr Thr Val Aen Cyg Gly Met Ala Tyr 
,U 24S 250 25S 

Gly Gin Ala Ala Met Leu Leu Ser Leu Gly Lye Lys Gly Phe Arg Ala 
15 , 260 265 270 



20 



Met Gin Pro Aan ser Ser Thr Lys Leu Tyr Leu Pro Lys Val Ser Lya 
275 230 2es * 

° ly Ala Val Ile ASP Met Trp Ile Ar S Ala L ^ a Glu Le * Agp 
290 235 3Q0 



25 JJj ASft Thr 0Xu ^ Leu Glu Leu Leu Ala Lya Gly Val Gly Lyg 

Pro Lya Glu Glu Ile Glu Lya Agp He Gin Arg Pro Lya Tyr Leu AT* 
40 325 330 ~ 335 

Ala Gin Glu Ala Ile Asp Tyr Gly Ile Ala Asp Lya He Ile Aep Ser 
35 340 345 ago 

Arg Asp Aen Ala Phe Glu Lyg Arg Asn Tyr Gly Glu He Leu Ala Gin 
355 36o 3*5 

40 

Ser Arg Ala Met Arg Lya Ala Gly Pro Gly Ala Gin Ala Ala Pro Ser 
370 375 3S0 

45 Gly Ser Arg 
385 



<210> 13 

50 <2H> 1246 

<2X2> DNA 

<213> Arabidopais thaliana 

<220> 

55 <221> CDS 

<222> (38) . . <1030> 
<222> 



60 <40Q> 13 

attttcgcga gcttccgtgt ccaagagcfcc cfcogacc atg S cg tct tgt tta caa 55 

Met Ala Ser Cys Leu Gin 
1 5 
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gca tec atg aat tct ctg ctt cca cgc tct tct tct ttt tct cct cat 103 

Ala Ser Met Asn Ser Leu Leu Pro Arg Ser Ser Ser Phe Ser Pro Hie 

3.0 15 20 

5 cct cct eta tct teg aat tea tec ggg aga aga aac ttg aag act ttt 151 

Pro Pro Leu Ser Ser Asn Ser Ser Gly Arg Arg Aan Leu Lys Thr Phe 

25 30 35 

cgt tac gec ttt cgc gec aaa gec tct gec aaa ate cat atg cct ccg 199 

10 Arg Tyr Ala Phe Arg Ala Lys Ala Ser Ala Lys He Pro Met Pro Pro 

40 45 50 

ata aat cca aag gat cct ttc etc tec acg etc get tct att gec gcg 24? 

He Asn Pro Lys Asp Pro Phe Leu Ser Thr Leu Ala Ser He Ala Ala 

15 55 €0 65 70 



20 



40 



60 



aat tct ccg gaa aag ctt etc aat egg ccg gtt aac get gat gtg ccg 295 
Aen. Ser Pro Glu Lye Leu Leu Asn Arg Pro Val Aan Ala Asp Val Pro 
75 80 95 

cca tat ctt gac ate ttt gac tec cct cag etc atg tct tct cct gca 343 
Pro Tyr Leu Asp He Phe Asp Ser Pro Gin Leu Met Ser Ser Pro Ala 
90 95 100 



25 cag gtt gaa aga tea gtg get tat aac gag cac cga ccg aga act cct 391 
Gin Val Glu Arg Ser Val Ala Tyr Aan Glu His Arg Pro Arg Thr Pro 
105 110 115 

cca cca gac ttg cca tct atg cct ctt gac ggg aga ate gtt tac att 439 
30 Pro Pro Asp Leu Pro Ser Met Leu Leu Asp Gly Arg He Val Tyr He 
120 125 130 

gga atg cct ctt gtg ccg gca gtg act gag eta gtt gtc get gag eta 487 
Gly Met Pro Leu val Pro Ala val Thr Glu Leu Val Val Ala Glu Leu 
35 135 140 145 150 

atg tat ctt cag tgg ctg gat ccc aag gaa ccc att tac att tac ate 535 
Met Tyr Leu Gin Trp Leu Asp Pro Lys Glu Pro He Tyr He Tyr He 
155 160 165 



aac tec aca ggg ace act cgt gat gat gga gag acg gtt gga atg gaa 583 
Asn Ser Thr Gly Thr Thr Arg Asp Asp Gly Glu Thr Val Gly Met Glu 
17Q 175 180 



45 tea gaa ggg ttt gcg ate tat gac tct ttg atg caa ctt aaa aac gag $31 
Ser Glu Gly Phe Ala He Tyr Asp Ser Leu Met Gin Leu Lys Asn Glu 
185 190 195 

gta cat aea gta fcgt gtg gga gca gee ata ggt cag gec tgt eta tta 679 
50 Val His Thr Val Cye Val Gly Ala Ala He Gly Gin Ala Cya Leu Leu 
200 205 210 

ctt tct gcg gga acc aag ggt aaa egg ttt atg atg cca cat gec aaa 727 
Leu ser Ala Gly Thr Lys Gly Lys Arg Phe Met Met Pro His Ala Lye 
55 215 220 225 230 

gcg atg att cag caa cca cgt gta cct tct tct ggg ttg atg cct gec 775 
Ala Met He Gin Gin Pro Arg Val Pro Ser Ser Gly Leu Met Pro Ala 
235 240 245 



agt gat gtc ctg att egg gec aaa gag gta att aca aat agg gat ata 823 
Ser Aap Val Leu He Arg Ala Lys Glu Val He Thr A&n Arg Aep He 
250 255 260 
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cct gtg gaa cna eta tea aag cat; act ggg aat tec gtg gag act gta 871 
Leu Val Glu Leu Leu Ser Lye His Xhr Gly Aen Ser val Glu Thr val 
265 270 275 

S get aac gta atg aga agg cca tat tac atg gat gca cca aaa get aaa 919 
Ala Aan Val Met Arg Arg Pro Tyr Tyr Met Asp Ala Pro Lys Ala Lys 
2S0 285 290 

gaa ttt gga gtc att gac agg att ctt tgg age ggt caa gaa aag att 967 
10 Glu Phe Gly Val lie Asp Arg lie Leu Trp Arg Gly Gin Glu Lys lie 
295 300 305 310 

att gcg gac gtg gtt cct tea gag gaa ttc gac aag aat gca ggg att 1015 
He Ala Asp Val Val Pro Ser Glu Glu Phe Asp Lys Asn Ala Gly He 
15 315 320 M 32S 



20 



aaa age gta gta tga gtctagtctt aagttttctt ggectaaate atactgegtc 1070 
Lye ser val val 
330 



atggagaaga acaaatagac tgaccaaaat cacattggcc gcagactgcc ttgtttcaaa 1130 
tcacttggta aatgtgaaca tgegattagg agaafceatae ttaaaggatc ttgaaatatt 1190 
25 acgataaaat tgtaatgtgt ttgttcgtta gcaatagtaa atacaatcct caacte 1246 

<210> 14 

<211> 330 

30 <2l2* P£T 

<213> Arabidopaia thaliana 

<400> 14 

35 Met Ala Ser Cys Leu Gin Ala Ser Met Abu Ser Leu Leu Pro Arg Ser 
15 10 15 

Ser Ser Phe Ser Pro Hie Pro Pro Leu Ser Ser Asn Ser Ser Gly Arg 
40 20 25 3 0 

Arg Asn Leu Lys Thr Phe Arg Tyr Ala Phe Arg Ala Lys Ala Ser Ala 
35 40 45 

45 

Lys He Pro Met Pro Pro He Aen Pro Lys Asp Pro Phe Leu Ser Thr 
50 55 ~ 60 

50 

Leu Ala Ser He Ala Ala Asn Ser Pro Glu Lys Leu Leu Asn Arg Pro 
65 70 75 80 

55 Val Asn Ala Asp Val Pro Pro Tyr Leu Asp He Phe Asp Ser Pro Gin 

65 90 95 

Leu Met Ser Ser Pro Ala Gin Val Glu Arg Ser Val Ala Tyr Asn Glu 
60 100 105 110 

Hie Arg Pro Arg Thr Pro Pro Pro Asp Leu Pro Ser Met Leu Leu Asp 
115 120 125 
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20 



Gly Arg lie Val Tyr tie Gly Mec Pro. Leu Val Pro Ala val Thr Glu 
"0 135 140 

D 

Leu Val Val Ala Glu Leu Wet Tyr Leu Gin Trp Leu Asp Pro Lys Olu 
145 150 155 160 

10 

Pro lie Tyr He Tyr He ASn Ser Thr Gly Thr Thr Arg Asp Asp Gly 

165 170 " 175 

15 Glu Thr Val Gly Met Glu Ser Glu Gly Phe Ala Tie Tyr Asp Ser Leu 
180 185 190 

Met Gin Leu Lys Aim Glu Val Hie Thr Val Cys Val Gly Ala Ala lie 
20 195 200 205 

Gly Gin Ala Cys Leu Leu Leu Ser Ala Gly Thr Lys Gly Lys Arg Phe 
^ c 210 215 220 

25 

Met Met Pro His Ala Lys Ala Met lie Gin Gin Pro Arg Val Pro Ser 
225 230 235 240 

30 

Ser Gly Leu Met Pro Ala Ser Asp Val Leu He Arg Ala Lys Glu Val 
245 250 255 

35 He Thr Aen Arg Asp He Leu Val Glu Leu Leu Ser Lys His Thr Gly 
260 2S5 270 

Agn Ser Val Glu Thr Val Ala Asn Val Met Arg Arg Pro Tyr Tyr Met 
40 275 280 285 

Asp Ala Pro Lyg Ala Lys Glu Phe Gly val He Asp Arg He Leu Trp 
290 295 300 

45 

Arg Gly Gin Glu Lys He He Ala Asp Val val Pro Ser Glu Glu Phe 
305 310 315 320 

50 

Asp Lys Asn Ala Gly He Lys Ser Val Val 
325 330 

55 <210> 15 

<211> 1236 
<212> DMA 

<213> Arabidopsis thaliana 

60 <220> 

<221> CDS 

<222> (66).. (983) 

<223> 



£Z'XZ'^X COOS'ZT'ZO 980 
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<400> 15 

agatcgtfcafc cgtttcgggg tcacagggao cctcactctt tctcnctctc cgcaacaaag so 

aa3aa Mef SS v£? IS ?? a ?? 9 a 2 C 9 ° 9 aog a9 ° ttc aoa acg ctt cga no 
Met Glu Val Ala Ala Ala Thr Ala Thr Ser Phe Thr Thr Leu Arg 

5 10 15 



10 K S ?S2 ?f 9 aC ° CCg tCt tc!t aca C 9 C aat ctff aga bet 158 
Ala Arg Thr ser Ala He l le Pxo Ser Ser xhr Arg Aan Leu Arg S r j8 

20 25 3 0 

206 



aaa ccg aga ttt bet tea tct tea tct etc aga get tat ctt tee, M <- 
lye Pro Arg Phe Ser Ser Ser Ser Ser Leu Arg E ser Leu Ser £ 
35 40 45 

» s ss s s s s if; ir? s s e s ^ s: ss - 

20 35 50 

Ilv E S? S!l 9 f 9 9 f a CCg att aat C °3 tta ttt tec age 302 

Gly Ala Lya Leu Arg Ala Glu Ser Leu Aan Pre Leu Asn Phe Ser Ser 

70 7g 



25 



Ser ^ SS * 9 ° 9 ? a SCt 9CC acfc at 9 3 fct a * a cct ttc tea aag 350 

Ser Lya Pro Lye Arg Gly Val Val Thr Met Val lie Pro Phe Ser Lya 

85 90 ss 

30 2K 1^ 2S £ aC Sf a °f a cct oct cct arac ttg gca tea tat ttg ttc 39a 
Gly ser Ala His Glu Gin Pro Pro Pro Asp Leu Ala Ser Tyr Leu Phe 



35 



100 105 " 110 

f* 9 aaC ^ 9a a ? C 91:3 cat tb Sf 99 a at 9 tct oto gta ccc tea ott acfc a** 

I.ye Aan Arg lie Val Tyr Leu Gly M Ser Leu val ^ £ g ^ a ^ 446 

115 120 12S 



S § S" S S 5 « K S S S| K JS K S 

230 235 



542 



590 



a ss s k a s ss g; ss ss £ k s k e « 

40 135 140 

5? Pro E Tvr SS MC tC9 aCt 993 aaa acG as, 9 aaC AW 

SS ^ le Asn Ser Thr 01 * Thr L ys ash Qiy 

145 150 155 J * 

45 Glu LCf SS r? C Sf 6 9at ^ b 939 9Cfc fctt 9ca at< = tat gat gte atg 
Glu Lya Leu Gly Tyr Aep Thr Glu Ala Phe Ala He Tyr Lp Val Met 

165 170 X75 

« ss s; e s: s s a sj s a e c s 5 638 

160 185 „ 0 

r?£ ff C T tt9 Ctt Ct9 acC 9° fc 99 c 9<=a aaa gga aat cga tot See 

5s Gly Glu Ala Ala Leu Leu Leu Thr Ala Gly Ala Lys Sly Asn Jg Ser 

195 200 205 

? tS tCa acC att atg ata a *9 oa 9 ccc att get OQa ttt -it a 

Ala Leu Pro Ser Ser Thr He MeC xle Lya Gin Pro He E ArJ P^e 

60 215 220 



7S2 



T3:^T COOZ'21'20 L80 



20 



25 



40 
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ata aag aca gaa atg gto aag ctg tat tea aag cat act ggt aaa tec 

lie Lys Thr Glu Met Val Lya Leu Tyr Ser Lys His lie Gly Lye Ser 

245 250 255 



30 <23.Q> 16 

<211> 305 

^2X2 > PKT 

<213> Arabxdopsis thaliana 

35 <400> 16 



Met Glu Val Ala Ala Ala Thr Ala Thr Ser Phe Thr Thr Leu Arg Ala 
1 5 10 is 

Arg Thr Ser Ala lie He Pro Ser Ser Thr Arg Asn Leu Arg Ser Lya 
50 25 30 



45 Pro Arg Phe Ser Ser Ser Ser Ser Leu Arg Ala Ser Leu Ser Asn Gly 
35 40 45 

Phe Leu Ser Pro Tyr Thr Gly Gly Ser He ser Ser Asp Leu Cys Gly 
0U 50 55 60 



55 



60 



Ala Lys Leu Arg Ala Glu Ser Leu Asn Pro Leu Aen Phe Ser Ser Ser 
65 . 10 75 80 

Lya Pro Lya Arg Gly Val Val Thr Met Val He Pro Phe Ser Lya Gly 
8* 90 95 

Ser Ala Glu Gin Pro Pro Pro Aap Leu Ala Ser Tyr Leu Phe Lya 
100 105 no 



630 



O ccg gag cag att gaa get gac acg aaa cgc ccg aaa cat ttt agt ccc 87S 
Pro Glu Gin lie Glu Ala Asp Met Lys Arg Pro Lya Tyr Phe Ser Pro 
260 265 270 

act gag gat get gaa tat ggg acc att gat aag gtg gtt tac aat gaa 926 
1U Thr Glu Ala Val Glu Tyr Gly He He Asp Lys Val Val Tyr Asn Glu 
275 280 285 

agg ggc age caa gac aga gga gtt gtg bet gac ctt aaa aag gca caa 974 
Arg Gly ser Gin Asp Arg Gly Val Val ser Asp Leu Lys Lye Ala Gin 
590 295 300 



etc att tga atgtoagaac tgtcttccga aaccccatga ttaa^aagtc 1023 
Leu He 
305 

ggagatctta ccgctgacca aatggggaat cagtgaacca ttcaccggca cagaactgag 10B3 

gtaaagtctg gaaaaeatgt taaaaaaggt tactagtaat getgeaattg tagggctatt 1143 

tgaacagaaa caaacccata tgtgtaggct tgtgaatgcc cagaaacagg actggtgtat 1203 

cttcaatata tgtttctaag atgaatcaat ttc 123 G 
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Asn Arg lie Val Tyr Leu Gly Met Ser Leu Val Pro Ser Val Thr- Glu 
Xl5 120 125 

5 Leu lie Leu Ala Glu Phe Leu Tyr Leu Gin Tyr Glu Asp Glu Glu Lys 
130 135 140 

<in I:L ® ^ Leu ^ Ile Asn 5er * hr Gl y Thr Thr L ys Aen Gly Glu 

10 145 150 155 ' 160 



Lys Leu Gly Tyr Asp Thr Glu Ala Phe Ala lie Tyr Asp Val Met Gly 

15 



1*5 170 175 



Tyr Val Lya Pro Pro lie Phe Thr Leu Cye Val Gly Asn Ala Trp Gly 
180 185 " 190 

20 

Glu Ala Ala Leu Leu Leu Thr Ala Gly Ala Lya Gly Asn Arg Ser Ala 
195 200 205 

25 Leu Pro Ser Ser Thr He Met He Lys Gin Pro He Ala Arg Phe Gin 
21° 215 220 

-an ?l y Gln Ala Aep Val Glu Ile Ala L 7 a Glu He Lye His He 

JU 225 230 235 ~ 2 4 o 

Lya Thr Glu Met Val Lys Leu Tyr Ser Lys His lie Gly Lye Ser Pro 
35 245 250 2S5 



40 



Glu Gin lie Glu Ala Aap Met Lye Arg Pro Lys Tyr Phe Ser Pro Thr 
260 265 270 

Glu Ala Val Glu Tyr Gly He He Asp Lys Val Val Tyr Asn Glu Arg 
275 280 285 



45 Gly Ser Gin Aap Arg Gly Val Val Ser Aep Leu Lys Lys Ala Gin Leu 
2 *° 295 300 

He 
50 305 



<2l0> 17 

<21l> $qG 

55 <212> DNA 

<;213> Nicotiana tabacum 

<220> 

<221> CDS 

60 <222> (45) . . (755) 
<223> 



<400> 17 



VT C002*ZT - 20 680 
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gcggccgctc caagatteat ccceaactct caacacattc aact atg cga acc caa 56 

Met Arg Thr Gin 
1 





att 


gbt 


cac 


a^a cue CLL 


aac 


cga 


aga 


acc 


aac 


gga 


acc 


cct 


t- y- 
uug 


aat 


JLU u 




lie 


Val 




T.-vr/a Ti^>n DWa 
JsJ jr d Jjcsu. lies 








Tip 

A it? 


Asn 


ux y 




Urn 


J-IC 1,1 


Ann 






5 




























10 


age 


age 


aag 


aga ttt tat 


ggg 


gta 


ata 


cca 


atg 


gta 


ata 


gag 


cac 


tct 


152 


Ser 


Ser 


Y Area 




V3-Ly 




Yin* 


Pro 












Ser 






























35 








tea 


aga 


gga 


gaa agg get 


tat 


gac 


ata 


ttc 


tea 


agg 


eta 


tta 


aag 


gaa 


200 


1 f=\ 

I v> 


Ser Arg 


/NT 

Gly 


Glu Arg Ala 


Tyr 


Aap 


lie 


Pile 


Ser 


Arg 


Leu 


Leu 


Xiye 


Glu 










40 






45 










50 










cga 


att 


atfc 


tgc att aac 


ggc 


ccc 


att 


gat 


gat 


tec 


act 


tct 




gtt 


243 




Arg 


He 


lie 


Cya lie Aen 


Gly 


Pro 


x ie 


Asp 


TV 

Aap 


Ser 


Thr 


Ser 


Hie 


Val 










55 






60 










65 










20 


































gtt gtt 


get 


cag ctt ett 


ttt 


ctt 


gaa 


tct 


gag 


aac 


cct 


tct 


aag 


cct 


396 




Val 


Val 


ax a 


Gin Leu Leu 


pne 


Leu 


Glu 


Ser 


Glu 


Asn 


Pro 


Ser 


Lya 


Pro 








70 






75 










80 












att 


cac 


aag 


tac etc aac 


tct 


cca 


ggt 


ggc 


get 


gtt 


aca 


get 


ggt 


ctt 






He 


Hie 


Lya 


Tyr Leu Aan 


Ser 


Pro 


Gly 


Gly 


Ala 


Val 


Thr 


Ala 


Gly Leu 






85 






90 










95 










100 




ou 


gca 


ate 


tat 


gat acc acg 


cag 


tat 


ate 


cga 


tct 


cca 


act 


cat 


act 


ata 


392 


Ala 


He 


Tyr 


Aap Thr Thr 


Gin 


Tyr 


lie 


Arg 


Ser 


pro 


lie 


His 


Thr 


He 












105 








inn 

110 










115 








tgc 


eta 


ggt 


caa gca got 


tea 


atg 


gga 


tec 


ctt 


etc 


tta 


get 


gca 


ggt 






Cya i,eu 


\?j.y 


ta-ill AX Aid 


Ser 




t?j.y 


Ser 


L@U 


ijeu 


Leu 


TV 1 a 


Ala Gly 


























1j U 










gca 


aag 


ggt 


gag aga cga 


tct 


etc 


cct 


a at 


get 


tea 


gtt 


atg 


att 


cac 


488 




Ala Lys 


vaxy 


LzJLVL iuy nly 


Ser 


Leu 




Asn 






Val 


L"lc= U. 


He 


Hi a 


























t a a 










40 


































cag 


cet 


ttc 


ggt ggg tat 


age 


ggg 


cag 


get 


aaa 


gat 


ttg 


acg 


ate 


cac 






Gin 


Pro 


Phe 








Gin 


Ala 


T.vn 


Aap 




Thr 


lie 


Hi a 








150 






JL -J — J 










JL O V 












45 


aca 


aaa 






gt a 




gat 


act 


ttg 


aat 




Cta 


tat 


gca 


584 




Thr 




Gin 


He Val Arg 


Val 


Tro 


Asp 


Thr 


Leu 


Asn 


Asp 


Leu 


Tyr Ala 






165 






170 










175 










ieo 




50 


aag 


cat 


aea 


gga caa cct 


ata 


gaa 


ata 


att 


caa 


aag 


aat 


atg 


gat 


agg 


632 


Lya 


His 


Thr 


Gly Gin pro 


He 


Glu 


He 


He 


Gin 


Lya 


Aen 


Met 


Aap Arg 












1B5 








ISO 










195 








gat 


tac 


ttc 


atg aea act 


gaa 


gag 


gcg 


aag 


gag 


ttt 


gga 


ata 


ate 


gat 


680 




Asp Tyx 


Phe 


Met Thr Pro 


Glu 


Glu 


Ala 


Lya 


Glu 


Phe 


Gly 


He 


He Aap 




55 








200 






205 










210 










gaa 


gtt 


ata 


gat gaa cga 


cca 


atg 


get 


tta 


gta 


act 


gat 


get 


gtt 


gca 


728 




Glu 


Val 


He 


Asp Glu Arg 


Pro 


Met 


Ala 


Leu 


Val 


Thr 


Asp 


Ala 


Val 


Ala 










215 






250 










225 










60 


































aat 


gaa 


gec 


aaa gaa aaa 


ggt 


tea 


age 


tagaaaaatt gctgtaatac 




775 




Aan 


Glu 


Ala 


Lya Glu Lys 


Gly 


Ser 


Ser 






















230 






235 
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tgatctcatt gcagcctttg ttagcattta ccatcgotaa ctagttetcc attfctaetrta 635 

ctggtgtafct tactttctag tatettattt gatgaggcga tacctoa^ta ctttgttttc BBS 

5 fcaagcggccg c 90$ 

«210> IB 

<211> 237 

10 <212> PRT 

«c2l3> JWicotiana tabacum 

«£400> 18 

15 Met Arg Thr Gin lie Val Hi© Lys Leu Phe Asn Arg Arg He Asn Gly 
15 10 15 

Thr Pro Leu Asn Ser Ser Lye Arg Phe Tyr Gly Val He £>ro Met: Val 
20 20 25 30 



25 



30 



45 



SO 



He Glu Hi© Ser Ser Arg Gly Glu Arg Ala Tyr Asp He Phe Ser Arg 
35 40 45 



Leu Leu Lys Glu Arg He He Cys He Asn Gly Pro He A8p Asp Ser 
50 55 60 



Thr Ser His Val Val Val Ala Gin Leu Leu Phe Leu Glu Ser Glu Aen 
65 70 75 60 



35 pro Ser Lys Pro He His Lys Tyr Leu Asn ser Pro Gly Gly Ala Val 

85 90 95 

Thr Ala Gly Leu Ala He Tyr Asp Thr Thr Gin Tyr He Arg Ser Pro 
40 100 105 110 



He Hi,e Thr tie Cys Leu Gly Gin Ala Ala Ser Met Gly Ser Leu Leu 
115 120 125 



Leu Ala Ala Gly Ala Lye Gly Glu Arg Arg Ser Leu Pro Asn Ala Ser 
130 135 140 



Val Met He Hie Gin Pro Phe Gly Gly Tyr Ser Gly Gin Ala Lys Asp 
145 150 155 160 



55 Leu Thr He His Thr Lye Gin He Val Arg Val Trp Asp Thr Leu Asn 

165 170 175 

Aep Leu Tyr Ala Lys Hi© Thr Gly Gin Pro He Glu He He Gin Lye 
60 180 185 190 



Asn Met Aep Arg Asp Tyr Phe Met Thr Pro Glu Glu Ala Lys Glu Phe 
19$ 200 205 



frT eO02*2T*Z0 160 
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10 



15 



26 



Gly lie Xle Asp Glu Val lie Asp Glu Arg Pro Met Ala Leu Val Thr 
210 215 220 

Asp Ala Val Ala Aan Glu Ala Lye Glu Lye Gly Ser Ser 
225 230 235 



<210> 19 

<2ll> 447 

<212> DNA 

<213> Nicotiana tabaaum 





<400> 19 

gcggccgctt gcggacaaga taatcgattc 


aagagacaat gtatttgaga aaaggaacta 


60 


20 


tgatgagata 


ctcgcccaat 


Ctagagctat 


gaggaaagcc ggaccaggtg ctcaggctgc 


120 


tccatctggc 


ttcaggtgac 


tggaagagcg 


gtaatggtcc caaaccttca ggaacaactg 


180 




tfcgttccctt 


atagtttcga ggaacaaagt 


tgctggttac ttggtctgtg ccggtataat 


240 


25 


gtaactggga caaagaacat attgtagaaa 


ccttgtttga gctgtgaagt ataggggtct 


300 




tacaactatt 


atgcacaggt 


ctgcaaagag 


tacccataat gtcaattggt tgtaccagta 


360 


on 
oU 


tcaaacaatc 


agatagtgcc 


agtgtatggt 


ataaatgaat atagatctct ctgatgtcat 


420 


ttttctttta 


tcatgttcag 


cggccgc 




447 


35 


«210> 20 
<211> 995 
<212> DNA 

<213> Nicotiana tabacum 






40 


c400> 20 

gcggccgctt 


gcggacaaga 


taatcgartc 


aagagacaat gtatttgaga aaaggaacta 


60 




tgatgagata 


ctcgcccaat 


Ctagagctat 


gaggaaagcc ggaccaggtg ctcaggetgc 


120 


45 


tccatctggc 


ttcaggtgac 


tggaagagcg 


gtaatggtcc caaactttca ggaacaactg 


150 


ttgttccctt 


atagtttcga 


ggaacaaagt 


tgctggttac ttggtctgtg ccggtataat 


240 




gtaactggga 


caaagaacat 


attgtagaaa 


ccttgtttga gctgtgaagt ataggggttt 


300 


50 


tacaactatt 


atgcacaggt 


ctgcaaagag 


tacacataat gtcaattggt tgtaccaggc 


360 




ggccgctggc 


ttcttctttg 


attctctctc 


cgctttctag ctcgacggtt actgaaaatc 


420 


55 


gcgagctggg 


ttctggtaaa 


ccaactttca 


tatccagtcc caatttctac tttgcaactt 


460 


ctgttcacag 


tcgcaggcca 


aacggcgttc 


gaggttattg tcacaggtct ccggtagcca 


S40 




agtctttgga 


ccatataccc 


caaaaattca 


gactggaaaa tctcaaagat ggactactgg 


600 


60 


acaactataa 


aagtgcccct 


cagtatcttt 


acggccttag tccttcaaag atggatatgt 


660 




tcatgacaga 


agataaacca 


gtaoggcgac 


agtcagaaag tgccaccgag gacagtacat 


720 




Ctgcggcc^c 


tggcagatgc 


tccacgaarg 


gataccacag caggtcttgg ttagtcoata 


7B0 
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30 



60 



27 

cacatagtat aatttatggc tgatagtggt cgtacgactt gcagtgtcat tttgcaattt 840 

cttttgttta atctacatat tgaactcttt tgatctactt attcaaaaac atgaaatcct 900 

gagcagacfca gatgcatbbg tttaatabca tgaatgcaag gaatccacct acagctgaba 960 

tgtatacaaa gatacctttt tttcaagagc ggccgc 996 



5 



10 



15 



20 

g egg ccg ctg gaa gat gcg egg cgc caa gtg aac gaa ggg get caa cct 49 

Arg Pro Leu Glu Asp Val Arg Arg Gin val Asn Glu Ala Val Gin Pro 
25 1 5 10 IS 

cgt cat aaa ate gac aag atg tat gtc gec ttt act gaa caa cca att 97 
Arg His Lye He Asp Lys Met Tyr val Ala J?*ie Thr Asp Gin Pro He 
20 25 30 



<2X0> 


21 


<211> 


$02 


<212> 


DNA 


<213> 


Nicotiana tabacum 


<220> 




<22l> 


CDS 


<222> 


(2) (193) 


<223> 




<400> 


21 



gag aag gtg caa cag tac act gaa agg gat cgt ttt fctg tct gfce tea 145 
Glu X.ys Val Glu Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Val Ser 
35 40 45 



35 gag gee atg gag ttt ggb etc ata gat ggg gtg eta gaa aca gaa tac 193 
Glu Ala Met Glu Phe Gly Leu lie Asp Gly val Leu Glu Thr Glu Tyr 
50 55 $0 

taert 

40 



45 



tagttgcaaa 


tgaatcttta 


gtagtacatg 


gtagctagee 


ttccaacgac gaaaaagctg 


253 


gtgttgctca 


tcaaccactt 


cgaagtacaa 


gaagctggct 


ettgeaaatt fcgtatcgtag 


313 


aaatatctca 


actcttcaat 


ccaggaatgt 


ccaaaagcct 


aattctgaag aeggttatag 


373 


aaagegctet 


tgttttacta 


tttttgtctc 


tectgeagat 


acactcagaa cttttgtggg 


433 


tattaatcag 


ggtcttaatc 


catcacttaa 


ccacaatcca 


gttggaageg aagtgatcaa 


493 


acacaa&gca 


gattcaggaa 


gatigtgtatt 


tttcccaaat 


atatattact ecaattgeta 


553 


tcatcccttc 


gctgtcgtta 


tgaaaggata 


tttattttat 


agcggccgc 


602 



50 



-:210? 22 

55 <211> 64 

<212> PRT 

<213> Nicotiana tabacum 



<400> 22 

Arg Pro Leu Glu Asp Val Arg Arg Gin Val Asn Glu Ala Val Gin Pro 
15 10 15 



£Z^l eOOZ'ZT'ZO £60 



BASF Aktlengasellschaft 20030949 PF 551 1 7 EP 

28 

Arg His Lys He Asp Lys Met Tyr Val Ala Phe Thr Asp Gin Pro He 
20 25 30 

5 Glu Lye val Gin Gin Tyr Thr Glu Arg Aap Arg Phe Leu Ser Val Ser 
35 40 45 

Glu Ala Met Glu Phe Gly Leu He Asp Gly Val Leu Glu Tfrr Glu Tyr 
10 50 55 60 



<210> 23 

15 <212> DMA 

<213> artificial sequence 



20 



35 



60 



<220> 

<223 > primer 
<400> 23 

agaafctc^cg gccgct 16 



25 <210> 24 

<211> 32 

<212> DMA 

<213> artificial sequence 

30 <220> 

<223> primer 



<400> 24 

ctcatgeggc cgcgcgcaac gcaattaatg tg 32 



<210> 25 

<211> 32 

<212> DNA 

40 <213> artificial sequence 

*220* 

<223> primer 

45 <400> 25 

tcafcgcggcc gcgagatcca gfcfccgatgta ac 32 

<210> 26 

50 <2X1> 21 

<212> DNA 

<213> artificial sequence 

*220> 

55 «223> primer 

c400> 26 

gtggattgat gtgatatcco c 2i 

<210> 27 

<211;> 21 

*212> DNA 

<2i3* artificial se<D£uence 



VZ : £Z*VI £0QZ'ZX m Z0 fr60 



BASF AkttengeseHschaft 



20030949 



PF 55117 EP 



29 



<220> 

<2Z3> primer 
<400> 27 

gtaaggatct gagctacaca t 



21 



10 



15 



20 



25 



30 



<21Q> 28 

<211> 24 

<212> PWA 

<213> artificial sequence 
<;220> 

<223:> primer 

<:400> 28 

tafcaccatgg atttgccatc tttg 



<210> 29 

<211> 21 

<212> DNA 

<213> artificial sequence 
<2Z0> 

<223> primer 

<£400> 29 

atagacctca cctggagcca g 



24 



21 



35 



40 



<210> 30 

<2ll> 19 

<212* DNA 

<213> artificial sequence 
<220> 

<223* primer 

<400> 30 

gancccatgg caagaggag 



19 



45 



50 



55 



<2X0> 31 

<2ll> 22 

<212> DNA 

<213> artificial sequence 
<220> 

<223 > primer 

<400> 31 

acagatcttt ctagcttgaa cc 



22 



<210> 32 

<211> 21 

<212> DNA 

60 <213> artificial sequence 

<220> 

<223> primer 



£Z:frT e00£*£T"£0 960 



96 N31I3S ±WdS3G 

BASF Aktiengeseltechaft 20030949 PF 551 1 7 EP 



15 



<400> 32 

tcagccatgg ccccfcggagg a 



<210> 33 
<2ll> 24 

<313> artificial eeCjuence 



10 <220> 

<223> primer 



<400> 33 

taagatcttc agtattcfcgt ttcc 



30 

21 



24 



9fr-£Z : n e002'ZT"30 960 
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